JUL 2 6 2007 

X<P cfv SEQUENCE LISTING 

<110> Ziir Megede, Jan 

Barnett, Susan W. 

Engelbrecht , Susan 

van Rensburg, Estrelita Janse 

<120> Polynucleotides Encoding Antigenic HIV Type C Polypeptides, 
Polypeptides and Uses Thereof 

<130> PP01631.102 

<140> 09/899,575 
<141> 2001-07-05 

<150> 09/610,313 
<151> 2000-07-05 

<160> 147 

<170> Patentin version 3.4 

<210> 1 
<211> 60 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 1 

gacatcaagc agggccccaa ggagcccttc cgcgactacg tggaccgctt cttcaagacc 60 



<210> 2 
<211> 60 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 2 

gacatccgcc agggccccaa ggagcccttc cgcgactacg tggaccgctt cttcaagacc 60 



<210> 3 

<211> 1479 

<212> DNA 

<213> Artificial 

<220> 

<223> synthetic Gag of HIV strain AF110965 
<400> 3 

atgggcgccc gcgccagcat cctgcgcggc ggcaagctgg acgcctggga gcgcatccgc 60 
ctgcgccccg gcggcaagaa gtgctacatg atgaagcacc tggtgtgggc cagccgcgag 120 
ctggagaagt tcgccctgaa ccccggcctg ctggagacca gcgagggctg caagcagatc 180 
atccgccagc tgcaccccgc cctgcagacc ggcagcgagg agctgaagag cctgttcaac 240 



r 



accgtggcca ccctgtactg cgtgcacgag aagatcgagg tccgcgacac caaggaggcc 300 

ctggacaaga tcgaggagga gcagaacaag tgccagcaga agatccagca ggccgaggcc 360 

gccgacaagg gcaaggtgag ccagaactac cccatcgtgc agaacctgca gggccagatg 42 0 

gtgcaccagg ccatcagccc ccgcaccctg aacgcctggg tgaaggtgat cgaggagaag 480 

gccttcagcc ccgaggtgat ccccatgttc accgccctga gcgagggcgc caccccccag 540 

gacctgaaca cgatgttgaa caccgtgggc ggccaccagg ccgccatgca gatgctgaag 600 

gacaccatca acgaggaggc cgccgagtgg gaccgcgtgc accccgtgca cgccggcccc 660 

atcgcccccg gccagatgcg cgagccccgc ggcagcgaca tcgccggcac caccagcacc 720 

ctgcaggagc agatcgcctg gatgaccagc aaccccccca tccccgtggg cgacatctac 780 

aagcggtgga tcatcctggg cctgaacaag atcgtgcgga tgtacagccc cgtgagcatc 840 

ctggacatca agcagggccc caaggagccc ttccgcgact acgtggaccg cttcttcaag 900 

accctgcgcg ccgagcagag cacccaggag gtgaagaact ggatgaccga caccctgctg 960 

gtgcagaacg ccaaccccga ctgcaagacc atcctgcgcg ctctcggccc cggcgccagc 1020 

ctggaggaga tgatgaccgc ctgccagggc gtgggcggcc ccagccacaa ggcccgcgtg 1080 

ctggccgagg cgatgagcca ggccaacacc agcgtgatga tgcagaagag caacttcaag 1140 

ggcccccggc gcatcgtcaa gtgcttcaac tgcggcaagg agggccacat cgcccgcaac 1200 

tgccgcgccc cccgcaagaa gggctgctgg aagtgcggca aggagggcca ccagatgaag 1260 

gactgcaccg agcgccaggc caacttcctg ggcaagatct ggcccagcca caagggccgc 1320 

cccggcaact tcctgcagag ccgccccgag cccaccgccc cccccgccga gagcttccgc 1380 

ttcgaggaga ccacccccgg ccagaagcag gagagcaagg accgcgagac cctgaccagc 1440 

ctgaagagcc tgttcggcaa cgaccccctg agccagtaa 147 9 

<210> 4 
<211> 1509 
<212> DNA 
<213> Artificial 

<220> 

<223> synthetic Gag of HIV strain AF110967 
<400> 4 

atgggcgccc gcgccagcat cctgcgcggc gagaagctgg acaagtggga gaagatccgc 60 

ctgcgccccg gcggcaagaa gcactacatg ctgaagcacc tggtgtgggc cagccgcgag 120 

ctggagggct tcgccctgaa ccccggcctg ctggagaccg ccgagggctg caagcagatc 180 



2 



atgaagcagc tgcagcccgc cctgcagacc ggcaccgagg agctgcgcag cctgtacaac 240 

accgtggcca ccctgtactg cgtgcacgcc ggcatcgagg tccgcgacac caaggaggcc 300 

ctggacaaga tcgaggagga gcagaacaag tcccagcaga agacccagca ggccaaggag 3 60 

gccgacggca aggtgagcca gaactacccc atcgtgcaga acctgcaggg ccagatggtg 420 

caccaggcca tcagcccccg caccctgaac gcctgggtga aggtgatcga ggagaaggcc 480 

ttcagccccg aggtgatccc catgttcacc gccctgagcg agggcgccac cccccaggac 540 

ctgaacacga tgttgaacac cgtgggcggc caccaggccg ccatgcagat gctgaaggac 600 

accatcaacg aggaggccgc cgagtgggac cgcctgcacc ccgtgcaggc cggccccgtg 660 

gcccccggcc agatgcgcga cccccgcggc agcgacatcg ccggcgccac cagcaccctg 720 

caggagcaga tcgcctggat gaccagcaac ccccccgtgc ccgtgggcga catctacaag 780 

cggtggatca tcctgggcct gaacaagatc gtgcggatgt acagccccgt gagcatcctg 840 

gacatccgcc agggccccaa ggagcccttc cgcgactacg tggaccgctt cttcaagacc 900 

ctgcgcgccg agcaggccac ccaggacgtg aagaactgga tgaccgagac cctgctggtg 960 

cagaacgcca accccgactg caagaccatc ctgcgcgctc tcggccccgg cgccaccctg 1020 

gaggagatga tgaccgcctg ccagggcgtg ggcggccccg gccacaaggc ccgcgtgctg 1080 

gccgaggcga tgagccaggc caacagcgtg aacatcatga tgcagaagag caacttcaag 1140 

ggcccccggc gcaacgtcaa gtgcttcaac tgcggcaagg agggccacat cgccaagaac 12 00 

tgccgcgccc cccgcaagaa gggctgctgg aagtgcggca aggagggcca ccagatgaag 1260 

gactgcaccg agcgccaggc caacttcctg ggcaagatct ggcccagcca caagggccgc 1320 

cccggcaact tcctgcagaa ccgcagcgag cccgccgccc ccaccgtgcc caccgccccc 1380 

cccgccgaga gcttccgctt cgaggagacc acccccgccc ccaagcagga gcccaaggac 1440 

cgcgagccct accgcgagcc cctgaccgcc ctgcgcagcc tgttcggcag cggccccctg 1500 

agccagtaa 1509 

<210> 5 
<211> 141 
<212> DNA 
<213> Artificial 

<220> 

<223> Env common region of HIV strain AF110968 
<400> 5 
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accatcacca tcacctgccg catcaagcag atcatcaaca 
gccatgtacg ccccccccat cgccggcaac ctgacctgcg 
ctgctgaccc gcgacggcgg c 



tgtggcagaa ggtgggccgc 
agagcaacat caccggcctg 



60 
120 
141 



<210> 6 
<211> 1431 
<212> DNA 
<213> Artificial 

<220> 

<223> synthetic gpl20 coding region of HIV strain AF110968 
<400> 6 

agcgtggtgg gcaacctgtg ggtgaccgtg tactacggcg tgcccgtgtg gaaggaggcc 60 

aagaccaccc tgttctgcac cagcgacgcc aaggcctacg agaccgaggt gcacaacgtg 120 

tgggccaccc acgcctgcgt gcccaccgac cccaaccccc aggagatcgt gctggagaac 180 

gtgaccgaga acttcaacat gtggaagaac gacatggtgg accagatgca cgaggacatc 240 

atcagcctgt gggaccagag cctgaagccc tgcgtgaagc tgacccccct gtgcgtgacc 300 

ctgaagtgcc gcaacgtgaa cgccaccaac aacatcaaca gcatgatcga caacagcaac 360 

aagggcgaga tgaagaactg cagcttcaac gtgaccaccg agctgcgcga ccgcaagcag 42 0 

gaggtgcacg ccctgttcta ccgcctggac gtggtgcccc tgcagggcaa caacagcaac 480 

gagtaccgcc tgatcaactg caacaccagc gccatcaccc aggcctgccc caaggtgagc 540 

ttcgacccca tccccatcca ctactgcacc cccgccggct acgccatcct gaagtgcaac 600 

aaccagacct tcaacggcac cggcccctgc aacaacgtga gcagcgtgca gtgcgcccac 660 

ggcatcaagc ccgtggtgag cacccagctg ctgctgaacg gcagcctggc caagggcgag 720 

atcatcatcc gcagcgagaa cctggccaac aacgccaaga tcatcatcgt gcagctgaac 780 

aagcccgtga agatcgtgtg cgtgcgcccc aacaacaaca cccgcaagag cgtgcgcatc 840 

ggccccggcc agaccttcta cgccaccggc gagatcatcg gcgacatccg ccaggcctac 900 

tgcatcatca acaagaccga gtggaacagc accctgcagg gcgtgagcaa gaagctggag 960 

gagcacttca gcaagaaggc catcaagttc gagcccagca gcggcggcga cctggagatc 102 0 

accacccaca gcttcaactg ccgcggcgag ttcttctact gcgacaccag ccagctgttc 1080 

aacagcacct acagccccag cttcaacggc accgagaaca agctgaacgg caccatcacc 1140 

atcacctgcc gcatcaagca gatcatcaac atgtggcaga aggtgggccg cgccatgtac 1200 

gcccccccca tcgccggcaa cctgacctgc gagagcaaca tcaccggcct gctgctgacc 1260 
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cgcgacggcg gcaagaccgg ccccaacgac accgagatct tccgccccgg cggcggcgac 
atgcgcgaca actggcgcaa cgagctgtac aagtacaagg tggtggagat caagcccctg 
ggcgtggccc ccaccgaggc caagcgccgc gtggtggagc gcgagaagcg c 



1320 
1380 
1431 



<210> 7 
<211> 1944 
<212> DNA 
<213> Artificial 

<220> 

<223> synthetic gpl40 coding region of HIV strain AF110968 
<400> 7 

agcgtggtgg gcaacctgtg ggtgaccgtg tactacggcg tgcccgtgtg gaaggaggcc 60 

aagaccaccc tgttctgcac cagcgacgcc aaggcctacg agaccgaggt gcacaacgtg 120 

tgggccaccc acgcctgcgt gcccaccgac cccaaccccc aggagatcgt gctggagaac 180 

gtgaccgaga acttcaacat gtggaagaac gacatggtgg accagatgca cgaggacatc 240 

atcagcctgt gggaccagag cctgaagccc tgcgtgaagc tgacccccct gtgcgtgacc 300 

ctgaagtgcc gcaacgtgaa cgccaccaac aacatcaaca gcatgatcga caacagcaac 360 

sagggcgaga tgaagaactg cagcttcaac gtgaccaccg agctgcgcga ccgcaagcag 420 

gaggtgcacg ccctgttcta ccgcctggac gtggtgcccc tgcagggcaa caacagcaac 480 

gagtaccgcc tgatcaactg caacaccagc gccatcaccc aggcctgccc caaggtgagc 540 

ttcgacccca tccccatcca ctactgcacc cccgccggct acgccatcct gaagtgcaac 600 

aaccagacct tcaacggcac cggcccctgc aacaacgtga gcagcgtgca gtgcgcccac 660 

ggcatcaagc ccgtggtgag cacccagctg ctgctgaacg gcagcctggc caagggcgag 720 

atcatcatcc gcagcgagaa cctggccaac aacgccaaga tcatcatcgt gcagctgaac 780 

aagcccgtga agatcgtgtg cgtgcgcccc aacaacaaca cccgcaagag cgtgcgcatc 840 

ggccccggcc agaccttcta cgccaccggc gagatcatcg gcgacatccg ccaggcctac 900 

tgcatcatca acaagaccga gtggaacagc accctgcagg gcgtgagcaa gaagctggag 960 

gagcacttca gcaagaaggc catcaagttc gagcccagca gcggcggcga cctggagatc 1020 

accacccaca gcttcaactg ccgcggcgag ttcttctact gcgacaccag ccagctgttc 1080 

aacagcacct acagccccag cttcaacggc accgagaaca agctgaacgg caccatcacc 1140 

atcacctgcc gcatcaagca gatcatcaac atgtggcaga aggtgggccg cgccatgtac 1200 
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gcccccccca tcgccggcaa cctgacctgc gagagcaaca tcaccggcct gctgctgacc 1260 

cgcgacggcg gcaagaccgg ccccaacgac accgagatct tccgccccgg cggcggcgac 1320 

atgcgcgaca actggcgcaa cgagctgtac aagtacaagg tggtggagat caagcccctg 1380 

ggcgtggccc ccaccgaggc caagcgccgc gtggtggagc gcgagaagcg cgccgtgggc 1440 

atcggcgccg tgttcctggg cttcctgggc gccgccggca gcaccatggg cgccgccagc 1500 

atcaccctga ccgtgcaggc ccgcctgctg ctgagcggca tcgtgcagca gcagaacaac 1560 

ctgctgcgcg ccatcgaggc ccagcagcac ctgctgcagc tgaccgtgtg gggcatcaag 1620 

cagctgcaga cccgcatcct ggccgtggag cgctacctga aggaccagca gctgctgggc 1680 

atctggggct gcagcggcaa gctgatctgc accaccgccg tgccctggaa cagcagctgg 1740 

agcaaccgca gccacgacga gatctgggac aacatgacct ggatgcagtg ggaccgcgag 1800 

atcaacaact acaccgacac catctaccgc ctgctggagg agagccagaa ccagcaggag 1860 

aagaacgaga aggacctgct ggccctggac agctggcaga acctgtggaa ctggttcagc 1920 

atcaccaact ggctgtggta catc 1944 

<210> 8 
<211> 2466 
<212> DNA 
<213> Artificial 

<220> 

<223> synthetic gpl60 coding region of HIV strain AF110968 
<400> 8 

agcgtggtgg gcaacctgtg ggtgaccgtg tactacggcg tgcccgtgtg gaaggaggcc 60 

aagaccaccc tgttctgcac cagcgacgcc aaggcctacg agaccgaggt gcacaacgtg 120 

tgggccaccc acgcctgcgt gcccaccgac cccaaccccc aggagatcgt gctggagaac 180 

gtgaccgaga acttcaacat gtggaagaac gacatggtgg accagatgca cgaggacatc 240 

atcagcctgt gggaccagag cctgaagccc tgcgtgaagc tgacccccct gtgcgtgacc 300 

ctgaagtgcc gcaacgtgaa cgccaccaac aacatcaaca gcatgatcga caacagcaac 360 

aagggcgaga tgaagaactg cagcttcaac gtgaccaccg agctgcgcga ccgcaagcag 420 

gaggtgcacg ccctgttcta ccgcctggac gtggtgcccc tgcagggcaa caacagcaac 480 

gagtaccgcc tgatcaactg caacaccagc gccatcaccc aggcctgccc caaggtgagc 540 

ttcgacccca tccccatcca ctactgcacc cccgccggct acgccatcct gaagtgcaac 600 

aaccagacct tcaacggcac cggcccctgc aacaacgtga gcagcgtgca gtgcgcccac 660 
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ggcatcaagc ccgtggtgag cacccagctg ctgctgaacg gcagcctggc caagggcgag 720 

atcatcatcc gcagcgagaa cctggccaac aacgccaaga tcatcatcgt gcagctgaac 780 

aagcccgtga agatcgtgtg cgtgcgcccc aacaacaaca cccgcaagag cgtgcgcatc 840 

ggccccggcc agaccttcta cgccaccggc gagatcatcg gcgacatccg ccaggcctac 900 

tgcatcatca acaagaccga gtggaacagc accctgcagg gcgtgagcaa gaagctggag 960 

gagcacttca gcaagaaggc catcaagttc gagcccagca gcggcggcga cctggagatc 1020 

accacccaca gcttcaactg ccgcggcgag ttcttctact gcgacaccag ccagctgttc 1080 

aacagcacct acagccccag cttcaacggc accgagaaca agctgaacgg caccatcacc 1140 

atcacctgcc gcatcaagca gatcatcaac atgtggcaga aggtgggccg cgccatgtac 1200 

gcccccccca tcgccggcaa cctgacctgc gagagcaaca tcaccggcct gctgctgacc 12 60 

cgcgacggcg gcaagaccgg ccccaacgac accgagatct tccgccccgg cggcggcgac 1320 

atgcgcgaca actggcgcaa cgagctgtac aagtacaagg tggtggagat caagcccctg 1380 

ggcgtggccc ccaccgaggc caagcgccgc gtggtggagc gcgagaagcg cgccgtgggc 1440 

atcggcgccg tgttcctggg cttcctgggc gccgccggca gcaccatggg cgccgccagc 1500 

atcaccctga ccgtgcaggc ccgcctgctg ctgagcggca tcgtgcagca gcagaacaac 1560 

ctgctgcgcg ccatcgaggc ccagcagcac ctgctgcagc tgaccgtgtg gggcatcaag 1620 

cagctgcaga cccgcatcct ggccgtggag cgctacctga aggaccagca gctgctgggc 1680 

atctggggct gcagcggcaa gctgatctgc accaccgccg tgccctggaa cagcagctgg 1740 

agcaaccgca gccacgacga gatctgggac aacatgacct ggatgcagtg ggaccgcgag 1800 

atcaacaact acaccgacac catctaccgc ctgctggagg agagccagaa ccagcaggag 1860 

aagaacgaga aggacctgct ggccctggac agctggcaga acctgtggaa ctggttcagc 1920 

atcaccaact ggctgtggta catcaagatc ttcatcatga tcgtgggcgg cctgatcggc 1980 

ctgcgcatca tcttcgccgt gctgagcatc gtgaaccgcg tgcgccaggg ctacagcccc 2040 

ctgcccttcc agaccctgac ccccaacccc cgcgagcccg accgcctggg ccgcatcgag 2100 

gaggagggcg gcgagcagga ccgcggccgc agcatccgcc tggtgagcgg cttcctggcc 2160 

ctggcctggg acgacctgcg cagcctgtgc ctgttcagct accaccgcct gcgcgacttc 2220 

atcctgatcg ccgcccgcgt gctggagctg ctgggccagc gcggctggga ggccctgaag 2280 

tacctgggca gcctggtgca gtactggggc ctggagctga agaagagcgc catcagcctg 2340 
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ctggacacca tcgccatcgc cgtggccgag ggcaccgacc gcatcatcga gttcatccag 
cgcatctgcc gcgccatccg caacatcccc cgccgcatcc gccagggctt cgaggccgcc 
ctgcag 



2400 
2460 
2466 



<210> 9 
<211> 2547 
<212> DNA 
<213> Artificial 

<220> 

<223> synthetic signal sequence and gpl60 coding region of HIV strain 
AF110968 

<400> 9 

atgcgcgtga tgggcatcct gaagaactac cagcagtggt ggatgtgggg catcctgggc 60 

ttctggatgc tgatcatcag cagcgtggtg ggcaacctgt gggtgaccgt gtactacggc 120 

gtgcccgtgt ggaaggaggc caagaccacc ctgttctgca ccagcgacgc caaggcctac 180 

gagaccgagg tgcacaacgt gtgggccacc cacgcctgcg tgcccaccga ccccaacccc 240 

caggagatcg tgctggagaa cgtgaccgag aacttcaaca tgtggaagaa cgacatggtg 300 

gaccagatgc acgaggacat catcagcctg tgggaccaga gcctgaagcc ctgcgtgaag 360 

ctgacccccc tgtgcgtgac cctgaagtgc cgcaacgtga acgccaccaa caacatcaac 420 

agcatgatcg acaacagcaa caagggcgag atgaagaact gcagcttcaa cgtgaccacc 480 

gagctgcgcg accgcaagca ggaggtgcac gccctgttct accgcctgga cgtggtgccc 540 

ctgcagggca acaacagcaa cgagtaccgc ctgatcaact gcaacaccag cgccatcacc 600 

caggcctgcc ccaaggtgag cttcgacccc atccccatcc actactgcac ccccgccggc 660 

tacgccatcc tgaagtgcaa caaccagacc ttcaacggca ccggcccctg caacaacgtg 720 

agcagcgtgc agtgcgccca cggcatcaag cccgtggtga gcacccagct gctgctgaac 780 

ggcagcctgg ccaagggcga gatcatcatc cgcagcgaga acctggccaa caacgccaag 840 

atcatcatcg tgcagctgaa caagcccgtg aagatcgtgt gcgtgcgccc caacaacaac 900 

acccgcaaga gcgtgcgcat cggccccggc cagaccttct acgccaccgg cgagatcatc 960 

ggcgacatcc gccaggccta ctgcatcatc aacaagaccg agtggaacag caccctgcag 1020 

ggcgtgagca agaagctgga ggagcacttc agcaagaagg ccatcaagtt cgagcccagc 1080 

agcggcggcg acctggagat caccacccac agcttcaact gccgcggcga gttcttctac 1140 

tgcgacacca gccagctgtt caacagcacc tacagcccca gcttcaacgg caccgagaac 1200 
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aagctgaacg gcaccatcac catcacctgc cgcatcaagc agatcatcaa catgtggcag 1260 

aaggtgggcc gcgccatgta cgcccccccc atcgccggca acctgacctg cgagagcaac 1320 

atcaccggcc tgctgctgac ccgcgacggc ggcaagaccg gccccaacga caccgagatc 1380 

ttccgccccg gcggcggcga catgcgcgac aactggcgca acgagctgta caagtacaag 1440 

gtggtggaga tcaagcccct gggcgtggcc cccaccgagg ccaagcgccg cgtggtggag 1500 

cgcgagaagc gcgccgtggg catcggcgcc gtgttcctgg gcttcctggg cgccgccggc 1560 

agcaccatgg gcgccgccag catcaccctg accgtgcagg cccgcctgct gctgagcggc 1620 

atcgtgcagc agcagaacaa cctgctgcgc gccatcgagg cccagcagca cctgctgcag 1680 

ctgaccgtgt ggggcatcaa gcagctgcag acccgcatcc tggccgtgga gcgctacctg 1740 

aaggaccagc agctgctggg catctggggc tgcagcggca agctgatctg caccaccgcc 1800 

gtgccctgga acagcagctg gagcaaccgc agccacgacg agate tggga caacatgacc 1860 

tggatgcagt gggaccgcga gatcaacaac tacaccgaca ccatctaccg cctgctggag 1920 

gagagccaga accagcagga gaagaacgag aaggacctgc tggccctgga cagctggcag 1980 

aacctgtgga actggttcag catcaccaac tggctgtggt acatcaagat cttcatcatg 2040 

atcgtgggcg gcctgatcgg cctgcgcatc atcttcgccg tgctgagcat cgtgaaccgc 2100 

gtgcgccagg gctacagccc cctgcccttc cagaccctga cccccaaccc ccgcgagccc 2160 

gaccgcctgg gccgcatcga ggaggagggc ggcgagcagg accgcggccg cagcatccgc 2220 

ctggtgagcg gcttcctggc cctggcctgg gacgacctgc gcagcctgtg cctgttcagc 2280 

taccaccgcc tgcgcgactt catcctgatc gccgcccgcg tgctggagct gctgggccag 2340 

cgcggctggg aggccctgaa gtacctgggc agcctggtgc agtactgggg cctggagctg 2400 

aagaagagcg ccatcagcct gctggacacc atcgccatcg ccgtggccga gggcaccgac 2460 

cgcatcatcg agttcatcca gcgcatctgc cgcgccatcc gcaacatccc ccgccgcatc 2520 

cgccagggct tcgaggccgc cctgcag 2547 



<210> 10 

<211> 1035 

<212> DNA 

<213> Artificial 

<220> 

<223> synthetic a gp41 coding region of HIV strain AF110968 

<400> 10 

gccgtgggca tcggcgccgt gttcctgggc ttcctgggcg ccgccggcag caccatgggc 60 
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gccgccagca tcaccctgac cgtgcaggcc cgcctgctgc tgagcggcat cgtgcagcag 120 

cagaacaacc tgctgcgcgc catcgaggcc cagcagcacc tgctgcagct gaccgtgtgg 180 

ggcatcaagc agctgcagac ccgcatcctg gccgtggagc gctacctgaa ggaccagcag 240 

ctgctgggca tctggggctg cagcggcaag ctgatctgca ccaccgccgt gccctggaac 300 

agcagctgga gcaaccgcag ccacgacgag atctgggaca acatgacctg gatgcagtgg 360 

gaccgcgaga tcaacaacta caccgacacc atctaccgcc tgctggagga gagccagaac 420 

cagcaggaga agaacgagaa ggacctgctg gccctggaca gctggcagaa cctgtggaac 480 

tggttcagca tcaccaactg gctgtggtac atcaagatct tcatcatgat cgtgggcggc 540 

ctgatcggcc tgcgcatcat cttcgccgtg ctgagcatcg tgaaccgcgt gcgccagggc 600 

tacagccccc tgcccttcca gaccctgacc cccaaccccc gcgagcccga ccgcctgggc 660 

cgcatcgagg aggagggcgg cgagcaggac cgcggccgca gcatccgcct ggtgagcggc 720 

ttcctggccc tggcctggga cgacctgcgc agcctgtgcc tgttcagcta ccaccgcctg 780 

cgcgacttca tcctgatcgc cgcccgcgtg ctggagctgc tgggccagcg cggctgggag 840 

gccctgaagt acctgggcag cctggtgcag tactggggcc tggagctgaa gaagagcgcc 900 

atcagcctgc tggacaccat cgccatcgcc gtggccgagg gcaccgaccg catcatcgag 960 

ttcatccagc gcatctgccg cgccatccgc aacatccccc gccgcatccg ccagggcttc 1020 

gaggccgccc tgcag 1035 

<210> 11 
<211> 144 
<212> DNA 
<213> Artificial 

<220> 

<223> synthetic Env common region of HIV strain AF110975 
<400> 11 

agcatcatca ccctgccctg ccgcatcaag cagatcatcg acatgtggca gaaggtgggc 60 

cgcgccatct acgccccccc catcgagggc aacatcacct gcagcagcag catcaccggc 120 

ctgctgctgg cccgcgacgg cggc 144 



<210> 12 

<211> 1437 

<212> DNA 

<213> Artificial 
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<220> 

<223> synthetic gpl20 coding region of HIV strain AF110975 
<400> 12 

agcggcctgg gcaacctgtg ggtgaccgtg tacgacggcg tgcccgtgtg gcgcgaggcc 60 

agcaccaccc tgttctgcgc cagcgacgcc aaggcctacg agaaggaggt gcacaacgtg 120 

tgggccaccc acgcctgcgt gcccaccgac cccaaccccc aggagatcga gctggacaac 180 

gtgaccgaga acttcaacat gtggaagaac gacatggtgg accagatgca cgaggacatc 240 

atcagcctgt gggaccagag cctgaagccc cgcgtgaagc tgacccccct gtgcgtgacc 300 

ctgaagtgca ccaactacag caccaactac agcaacacca tgaacgccac cagctacaac 360 

aacaacacca ccgaggagat caagaactgc accttcaaca tgaccaccga gctgcgcgac 420 

aagaagcagc aggtgtacgc cctgttctac aagctggaca tcgtgcccct gaacagcaac 480 

agcagcgagt accgcctgat caactgcaac accagcgcca tcacccaggc ctgccccaag 540 

gtgagcttcg accccatccc catccactac tgcgcccccg ccggctacgc catcctgaag 600 

tgcaagaaca acaccagcaa cggcaccggc ccctgccaga acgtgagcac cgtgcagtgc 660 

acccacggca tcaagcccgt ggtgagcacc cccctgctgc tgaacggcag cctggccgag 720 

ggcggcgaga tcatcatccg cagcaagaac ctgagcaaca acgcctacac catcatcgtg 780 

cacctgaacg acagcgtgga gatcgtgtgc acccgcccca acaacaacac ccgcaagggc 840 

atccgcatcg gccccggcca gaccttctac gccaccgaga acatcatcgg cgacatccgc 900 

caggcccact gcaacatcag cgccggcgag tggaacaagg ccgtgcagcg cgtgagcgcc 960 

aagctgcgcg agcacttccc caacaagacc atcgagttcc agcccagcag cggcggcgac 1020 

ctggagatca ccacccacag cttcaactgc cgcggcgagt tcttctactg caacaccagc 1080 

aagctgttca acagcagcta caacggcacc agctaccgcg gcaccgagag caacagcagc 1140 

atcatcaccc tgccctgccg catcaagcag atcatcgaca tgtggcagaa ggtgggccgc 1200 

gccatctacg ccccccccat cgagggcaac atcacctgca gcagcagcat caccggcctg 1260 

ctgctggccc gcgacggcgg cctggacaac atcaccaccg agatcttccg cccccagggc 1320 

ggcgacatga aggacaactg gcgcaacgag ctgtacaagt acaaggtggt ggagatcaag 1380 

cccctgggcg tggcccccac cgaggccaag cgccgcgtgg tggagcgcga gaagcgc 1437 

<210> 13 
<211> 1950 
<212> DNA 
<213> Artificial 
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<220> 

<223> synthetic gpl40 coding region of HIV strain AF110975 
<400> 13 

agcggcctgg gcaacctgtg ggtgaccgtg tacgacggcg tgcccgtgtg gcgcgaggcc 60 

agcaccaccc tgttctgcgc cagcgacgcc aaggcctacg agaaggaggt gcacaacgtg 120 

tgggccaccc acgcctgcgt gcccaccgac cccaaccccc aggagatcga gctggacaac 180 

gtgaccgaga acttcaacat gtggaagaac gacatggtgg accagatgca cgaggacatc 240 

atcagcctgt gggaccagag cctgaagccc cgcgtgaagc tgacccccct gtgcgtgacc 300 

ctgaagtgca ccaactacag caccaactac agcaacacca tgaacgccac cagctacaac 360 

aacaacacca ccgaggagat caagaactgc accttcaaca tgaccaccga gctgcgcgac 420 

aagaagcagc aggtgtacgc cctgttctac aagctggaca tcgtgcccct gaacagcaac 480 

agcagcgagt accgcctgat caactgcaac accagcgcca tcacccaggc ctgccccaag 540 

gtgagcttcg accccatccc catccactac tgcgcccccg ccggctacgc catcctgaag 600 

tgcaagaaca acaccagcaa cggcaccggc ccctgccaga acgtgagcac cgtgcagtgc 660 

acccacggca tcaagcccgt ggtgagcacc cccctgctgc tgaacggcag cctggccgag 720 

ggcggcgaga tcatcatccg cagcaagaac ctgagcaaca acgcctacac catcatcgtg 780 

cacctgaacg acagcgtgga gatcgtgtgc acccgcccca acaacaacac ccgcaagggc 840 

atccgcatcg gccccggcca gaccttctac gccaccgaga acatcatcgg cgacatccgc 900 

caggcccact gcaacatcag cgccggcgag tggaacaagg ccgtgcagcg cgtgagcgcc 960 

aagctgcgcg agcacttccc caacaagacc atcgagttcc agcccagcag cggcggcgac 1020 

ctggagatca ccacccacag cttcaactgc cgcggcgagt tcttctactg caacaccagc 1080 

aagctgttca acagcagcta caacggcacc agctaccgcg gcaccgagag caacagcagc 1140 

atcatcaccc tgccctgccg catcaagcag atcatcgaca tgtggcagaa ggtgggccgc 1200 

gccatctacg ccccccccat cgagggcaac atcacctgca gcagcagcat caccggcctg 1260 

ctgctggccc gcgacggcgg cctggacaac atcaccaccg agatcttccg cccccagggc 1320 

ggcgacatga aggacaactg gcgcaacgag ctgtacaagt acaaggtggt ggagatcaag 1380 

cccctgggcg tggcccccac cgaggccaag cgccgcgtgg tggagcgcga gaagcgcgcc 1440 

gtgggcatcg gcgccgtgat cttcggcttc ctgggcgccg ccggcagcaa catgggcgcc 1500 

gccagcatca ccctgaccgc ccaggcccgc cagctgctga gcggcatcgt gcagcagcag 1560 
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agcaacctgc tgcgcgccat cgaggcccag cagcacatgc tgcagctgac cgtgtggggc 1620 

atcaagcagc tgcaggcccg cgtgctggcc atcgagcgct acctgaagga ccagcagctg 1680 

ctgggcatct ggggctgcag cggcaagctg atctgcacca ccaccgtgcc ctggaacagc 1740 

agctggagca acaagaccca gggcgagatc tgggagaaca tgacctggat gcagtgggac 1800 

aaggagatca gcaactacac cggcatcatc taccgcctgc tggaggagag ccagaaccag 1860 

caggagcaga acgagaagga cctgctggcc ctggacagcc gcaacaacct gtggagctgg 1920 

ttcaacatca gcaactggct gtggtacatc 1950 

<210> 14 
<211> 2493 
<212> DNA 
<213> Artificial 

<220> 

<223> synthetic gpl60 coding region of HIV strain AF110975 
<400> 14 

agcggcctgg gcaacctgtg ggtgaccgtg tacgacggcg tgcccgtgtg gcgcgaggcc 60 

agcaccaccc tgttctgcgc cagcgacgcc aaggcctacg agaaggaggt gcacaacgtg 12 0 

tgggccaccc acgcctgcgt gcccaccgac cccaaccccc aggagatcga gctggacaac 180 

gtgaccgaga acttcaacat gtggaagaac gacatggtgg accagatgca cgaggacatc 240 

atcagcctgt gggaccagag cctgaagccc cgcgtgaagc tgacccccct gtgcgtgacc 300 

ctgaagtgca ccaactacag caccaactac agcaacacca tgaacgccac cagctacaac 360 

aacaacacca ccgaggagat caagaactgc accttcaaca tgaccaccga gctgcgcgac 42 0 

aagaagcagc aggtgtacgc cctgttctac aagctggaca tcgtgcccct gaacagcaac 480 

agcagcgagt accgcctgat caactgcaac accagcgcca tcacccaggc ctgccccaag 540 

gtgagcttcg accccatccc catccactac tgcgcccccg ccggctacgc catcctgaag 600 

tgcaagaaca acaccagcaa cggcaccggc ccctgccaga acgtgagcac cgtgcagtgc 660 

acccacggca tcaagcccgt ggtgagcacc cccctgctgc tgaacggcag cctggccgag 720 

ggcggcgaga tcatcatccg cagcaagaac ctgagcaaca acgcctacac catcatcgtg 780 

cacctgaacg acagcgtgga gatcgtgtgc acccgcccca acaacaacac ccgcaagggc 840 

atccgcatcg gccccggcca gaccttctac gccaccgaga acatcatcgg cgacatccgc 900 

caggcccact gcaacatcag cgccggcgag tggaacaagg ccgtgcagcg cgtgagcgcc 960 

aagctgcgcg agcacttccc caacaagacc atcgagttcc agcccagcag cggcggcgac 1020 
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ctggagatca ccacccacag cttcaactgc cgcggcgagt tcttctactg caacaccagc 1080 

aagctgttca acagcagcta caacggcacc agctaccgcg gcaccgagag caacagcagc 1140 

atcatcaccc tgccctgccg catcaagcag atcatcgaca tgtggcagaa ggtgggccgc 12 00 

gccatctacg ccccccccat cgagggcaac atcacctgca gcagcagcat caccggcctg 1260 

ctgctggccc gcgacggcgg cctggacaac atcaccaccg agatcttccg cccccagggc 1320 

ggcgacatga aggacaactg gcgcaacgag ctgtacaagt acaaggtggt ggagatcaag 1380 

cccctgggcg tggcccccac cgaggccaag cgccgcgtgg tggagcgcga gaagcgcgcc 1440 

gtgggcatcg gcgccgtgat cttcggcttc ctgggcgccg ccggcagcaa catgggcgcc 1500 

gccagcatca ccctgaccgc ccaggcccgc cagctgctga gcggcatcgt gcagcagcag 1560 

agcaacctgc tgcgcgccat cgaggcccag cagcacatgc tgcagctgac cgtgtggggc 1620 

atcaagcagc tgcaggcccg cgtgctggcc atcgagcgct acctgaagga ccagcagctg 1680 

ctgggcatct ggggctgcag cggcaagctg atctgcacca ccaccgtgcc ctggaacagc 1740 

agctggagca acaagaccca gggcgagatc tgggagaaca tgacctggat gcagtgggac 1800 

aaggagatca gcaactacac cggcatcatc taccgcctgc tggaggagag ccagaaccag 1860 

caggagcaga acgagaagga cctgctggcc ctggacagcc gcaacaacct gtggagctgg 1920 

ttcaacatca gcaactggct gtggtacatc aagatcttca tcatgatcgt gggcggcctg 1980 

atcggcctgc gcatcatctt cgccgtgctg agcatcgtga accgcgtgcg ccagggctac 2040 

agccccctga gcttccagac cctgaccccc aacccccgcg gcctggaccg cctgggccgc 2100 

atcgaggagg agggcggcga gcaggaccgc gaccgcagca tccgcctggt gcagggcttc 2160 

ctggccctgg cctgggacga cctgcgcagc ctgtgcctgt tcagctacca ccgcctgcgc 2220 

gacctgatcc tggtgaccgc ccgcgtggtg gagctgctgg gccgcagcag cccccgcggc 22 80 

ctgcagcgcg gctgggaggc cctgaagtac ctgggcagcc tggtgcagta ctggggcctg 2340 

gagctgaaga agagcgccac cagcctgctg gacagcatcg ccatcgccgt ggccgagggc 2400 

accgaccgca tcatcgaggt gatccagcgc atctaccgcg ccttctgcaa catcccccgc 2460 

cgcgtgcgcc agggcttcga ggccgccctg cag 2493 

<210> 15 
<211> 2565 
<212> DNA 
<213> Artificial 

14 



<220> 

<223> synthetic signal sequence and gpl60 coding region of HIV strain 
AF110975 

<400> 15 

atgcgcgtgc gcggcatcct gcgcagctgg cagcagtggt ggatctgggg catcctgggc 60 

ttctggatct gcagcggcct gggcaacctg tgggtgaccg tgtacgacgg cgtgcccgtg 120 

tggcgcgagg ccagcaccac cctgttctgc gccagcgacg ccaaggccta cgagaaggag 180 

gtgcacaacg tgtgggccac ccacgcctgc gtgcccaccg accccaaccc ccaggagatc 240 

gagctggaca acgtgaccga gaacttcaac atgtggaaga acgacatggt ggaccagatg 300 

cacgaggaca tcatcagcct gtgggaccag agcctgaagc cccgcgtgaa gctgaccccc 360 

ctgtgcgtga ccctgaagtg caccaactac agcaccaact acagcaacac catgaacgcc 420 

accagctaca acaacaacac caccgaggag atcaagaact gcaccttcaa catgaccacc 480 

gagctgcgcg acaagaagca gcaggtgtac gccctgttct acaagctgga catcgtgccc 540 

ctgaacagca acagcagcga gtaccgcctg atcaactgca acaccagcgc catcacccag 600 

gcctgcccca aggtgagctt cgaccccatc cccatccact actgcgcccc cgccggctac 660 

gccatcctga agtgcaagaa caacaccagc aacggcaccg gcccctgcca gaacgtgagc 720 

accgtgcagt gcacccacgg catcaagccc gtggtgagca cccccctgct gctgaacggc 780 

agcctggccg agggcggcga gatcatcatc cgcagcaaga acctgagcaa caacgcctac 840 

accatcatcg tgcacctgaa cgacagcgtg gagatcgtgt gcacccgccc caacaacaac 900 

acccgcaagg gcatccgcat cggccccggc cagaccttct acgccaccga gaacatcatc 960 

ggcgacatcc gccaggccca ctgcaacatc agcgccggcg agtggaacaa ggccgtgcag 1020 

cgcgtgagcg ccaagctgcg cgagcacttc cccaacaaga ccatcgagtt ccagcccagc 1080 

agcggcggcg acctggagat caccacccac agcttcaact gccgcggcga gttcttctac 1140 

tgcaacacca gcaagctgtt caacagcagc tacaacggca ccagctaccg cggcaccgag 1200 

agcaacagca gcatcatcac cctgccctgc cgcatcaagc agatcatcga catgtggcag 1260 

aaggtgggcc gcgccatcta cgcccccccc atcgagggca acatcacctg cagcagcagc 1320 

atcaccggcc tgctgctggc ccgcgacggc ggcctggaca acatcaccac cgagatcttc 1380 

cgcccccagg gcggcgacat gaaggacaac tggcgcaacg agctgtacaa gtacaaggtg 1440 

gtggagatca agcccctggg cgtggccccc accgaggcca agcgccgcgt ggtggagcgc 1500 

gagaagcgcg ccgtgggcat cggcgccgtg atcttcggct tcctgggcgc cgccggcagc 1560 
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aacatgggcg ccgccagcat caccctgacc gcccaggccc gccagctgct gagcggcatc 1620 

gtgcagcagc agagcaacct gctgcgcgcc atcgaggccc agcagcacat gctgcagctg 1680 

accgtgtggg gcatcaagca gctgcaggcc cgcgtgctgg ccatcgagcg ctacctgaag 1740 

gaccagcagc tgctgggcat ctggggctgc agcggcaagc tgatctgcac caccaccgtg 1800 

ccctggaaca gcagctggag caacaagacc cagggcgaga tctgggagaa catgacctgg 1860 

atgcagtggg acaaggagat cagcaactac accggcatca tctaccgcct gctggaggag 1920 

agccagaacc agcaggagca gaacgagaag gacctgctgg ccctggacag ccgcaacaac 1980 

ctgtggagct ggttcaacat cagcaactgg ctgtggtaca tcaagatctt catcatgatc 2040 

gtgggcggcc tgatcggcct gcgcatcatc ttcgccgtgc tgagcatcgt gaaccgcgtg 2100 

cgccagggct acagccccct gagcttccag accctgaccc ccaacccccg cggcctggac 2160 

cgcctgggcc gcatcgagga ggagggcggc gagcaggacc gcgaccgcag catccgcctg 2220 

gtgcagggct tcctggccct ggcctgggac gacctgcgca gcctgtgcct gttcagctac 2280 

caccgcctgc gcgacctgat cctggtgacc gcccgcgtgg tggagctgct gggccgcagc 2340 

agcccccgcg gcctgcagcg cggctgggag gccctgaagt acctgggcag cctggtgcag 2400 

tactggggcc tggagctgaa gaagagcgcc accagcctgc tggacagcat cgccatcgcc 2460 

gtggccgagg gcaccgaccg catcatcgag gtgatccagc gcatctaccg cgccttctgc 2520 

aacatccccc gccgcgtgcg ccagggcttc gaggccgccc tgcag 2565 

<210> 16 
<211> 1056 
<212> DNA 
<213> Artificial 

<220> 

<223> synthetic a gp41 coding region of HIV strain AF110975 
<400> 16 

gccgtgggca tcggcgccgt gatcttcggc ttcctgggcg ccgccggcag caacatgggc 60 

gccgccagca tcaccctgac cgcccaggcc cgccagctgc tgagcggcat cgtgcagcag 120 

cagagcaacc tgctgcgcgc catcgaggcc cagcagcaca tgctgcagct gaccgtgtgg 180 

ggcatcaagc agctgcaggc ccgcgtgctg gccatcgagc gctacctgaa ggaccagcag 240 

ctgctgggca tctggggctg cagcggcaag ctgatctgca ccaccaccgt gccctggaac 300 

agcagctgga gcaacaagac ccagggcgag atctgggaga acatgacctg gatgcagtgg 3 60 

gacaaggaga tcagcaacta caccggcatc atctaccgcc tgctggagga gagccagaac 42 0 



16 



cagcaggagc agaacgagaa ggacctgctg gccctggaca gccgcaacaa cctgtggagc 480 

tggttcaaca tcagcaactg gctgtggtac atcaagatct tcatcatgat cgtgggcggc 540 

ctgatcggcc tgcgcatcat cttcgccgtg ctgagcatcg tgaaccgcgt gcgccagggc 600 

tacagccccc tgagcttcca gaccctgacc cccaaccccc gcggcctgga ccgcctgggc 660 

cgcatcgagg aggagggcgg cgagcaggac cgcgaccgca gcatccgcct ggtgcagggc 720 

ttcctggccc tggcctggga cgacctgcgc agcctgtgcc tgttcagcta ccaccgcctg 780 

cgcgacctga tcctggtgac cgcccgcgtg gtggagctgc tgggccgcag cagcccccgc 840 

ggcctgcagc gcggctggga ggccctgaag tacctgggca gcctggtgca gtactggggc 900 

ctggagctga agaagagcgc caccagcctg ctggacagca tcgccatcgc cgtggccgag 960 

ggcaccgacc gcatcatcga ggtgatccag cgcatctacc gcgccttctg caacatcccc 1020 

cgccgcgtgc gccagggctt cgaggccgcc ctgcag 1056 

<210> 17 
<211> 492 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 17 

Met Gly Ala Arg Ala Ser lie Leu Arg Gly Gly Lys Leu Asp Ala Trp 



Glu Arg lie Arg Leu Arg Pro Gly Gly Lys Lys Cys Tyr Met Met Lys 
20 25 30 

His Leu Val Trp Ala Ser Arg Glu Leu Glu Lys Phe Ala Leu Asn Pro 
35 40 45 

Gly Leu Leu Glu Thr Ser Glu Gly Cys Lys Gin lie lie Arg Gin Leu 
50 55 60 

His Pro Ala Leu Gin Thr Gly Ser Glu Glu Leu Lys Ser Leu Phe Asn 
65 70 75 80 

Thr Val Ala Thr Leu Tyr Cys Val His Glu Lys lie Glu Val Arg Asp 
85 90 95 

Thr Lys Glu Ala Leu Asp Lys lie Glu Glu Glu Gin Asn Lys Cys Gin 
100 105 110 
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Gin Lys He Gin Gin Ala Glu Ala Ala Asp Lys Gly Lys Val Ser Gin 
115 120 125 



Asn Tyr Pro He Val Gin Asn Leu Gin Gly Gin Met Val His Gin Ala 
130 135 140 



He Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val He Glu Glu Lys 
145 150 155 160 



Ala Phe Ser Pro Glu Val He Pro Met Phe Thr Ala Leu Ser Glu Gly 
165 170 175 



Ala Thr Pro Gin Asp Leu Asn Thr Met Leu Asn Thr Val Gly Gly His 
180 185 190 



Gin Ala Ala Met Gin Met Leu Lys Asp Thr He Asn Glu Glu Ala Ala 
195 200 205 



Glu Trp Asp Arg Val His Pro Val His Ala Gly Pro He Ala Pro Gly 
210 215 220 



Gin Met Arg Glu Pro Arg Gly Ser Asp He Ala Gly Thr Thr Ser Thr 
225 230 235 240 



Leu Gin Glu Gin He Ala Trp Met Thr Ser Asn Pro Pro He Pro Val 
245 250 255 



Gly Asp He Tyr Lys Arg Trp He He Leu Gly Leu Asn Lys He Val 
260 265 270 



Arg Met Tyr Ser Pro Val Ser He Leu Asp He Lys Gin Gly Pro Lys 
275 280 285 



Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Phe Lys Thr Leu Arg Ala 
290 295 300 



Glu Gin Ser Thr Gin Glu Val Lys Asn Trp Met Thr Asp Thr Leu Leu 
305 310 315 320 



Val Gin Asn Ala Asn Pro Asp Cys Lys Thr He Leu Arg Ala Leu Gly 
325 330 335 
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Pro Gly Ala Ser Leu Glu Glu Met Met Thr Ala Cys Gin Gly Val Gly 
340 345 350 



Gly Pro Ser His Lys Ala Arg Val Leu Ala Glu Ala Met Ser Gin Ala 
355 360 365 



Asn Thr Ser Val Met Met Gin Lys Ser Asn Phe Lys Gly Pro Arg Arg 
370 375 380 



lie Val Lys Cys Phe Asn Cys Gly Lys Glu Gly His lie Ala Arg Asn 
385 390 395 400 



Cys Arg Ala Pro Arg Lys Lys Gly Cys Trp Lys Cys Gly Lys Glu Gly 
405 410 415 



His Gin Met Lys Asp Cys Thr Glu Arg Gin Ala Asn Phe Leu Gly Lys 
420 425 430 



lie Trp Pro Ser His Lys Gly Arg Pro Gly Asn Phe Leu Gin Ser Arg 
435 440 445 



Pro Glu Pro Thr Ala Pro Pro Ala Glu Ser Phe Arg Phe Glu Glu Thr 
450 455 460 



Thr Pro Gly Gin Lys Gin Glu Ser Lys Asp Arg Glu Thr Leu Thr Ser 
465 470 475 480 



Leu Lys Ser Leu Phe Gly Asn Asp Pro Leu Ser Gin 
485 490 



<210> 


18 


<211> 


81 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


synthetic signal 


<400> 
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atgcgcgtga tgggcatcct gaagaactac cagcagtggt ggatgtgggg catcctgggc 60 
ttctggatgc tgatcatcag c 81 



<210> 19 
<211> 72 
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<212> DNA 

<213> Artificial 



<220> 

<223> synthetic signal sequence of HIV strain AF110975 
<400> 19 

atgcgcgtgc gcggcatcct gcgcagctgg cagcagtggt ggatctgggg catcctgggc 60 
ttctggatct gc 72 

<210> 20 

<211> 1479 

<212> DNA 

<213> Artificial 

<220> 

<223> synthetic Gag coding sequence of HIV strain AF110965 
<400> 20 

atgggcgccc gcgccagcat cctgcgcggc ggcaagctgg acgcctggga gcgcatccgc 60 

ctgcgccccg gcggcaagaa gtgctacatg atgaagcacc tggtgtgggc cagccgcgag 120 

ctggagaagt tcgccctgaa ccccggcctg ctggagacca gcgagggctg caagcagatc 180 

atccgccagc tgcaccccgc cctgcagacc ggcagcgagg agctgaagag cctgttcaac 240 

accgtggcca ccctgtactg cgtgcacgag aagatcgagg tgcgcgacac caaggaggcc 3 00 

ctggacaaga tcgaggagga gcagaacaag tgccagcaga agatccagca ggccgaggcc 360 

gccgacaagg gcaaggtgag ccagaactac cccatcgtgc agaacctgca gggccagatg 420 

gtgcaccagg ccatcagccc ccgcaccctg aacgcctggg tgaaggtgat cgaggagaag 480 

gccttcagcc ccgaggtgat ccccatgttc accgccctga gcgagggcgc caccccccag 540 

gacctgaaca ccatgctgaa caccgtgggc ggccaccagg ccgccatgca gatgctgaag 600 

gacaccatca acgaggaggc cgccgagtgg gaccgcgtgc accccgtgca cgccggcccc 660 

atcgcccccg gccagatgcg cgagccccgc ggcagcgaca tcgccggcac caccagcacc 720 

ctgcaggagc agatcgcctg gatgaccagc aaccccccca tccccgtggg cgacatctac 780 

aagcgctgga tcatcctggg cctgaacaag atcgtgcgca tgtacagccc cgtgagcatc 840 

ctggacatca agcagggccc caaggagccc ttccgcgact acgtggaccg cttcttcaag 900 

accctgcgcg ccgagcagag cacccaggag gtgaagaact ggatgaccga caccctgctg 960 

gtgcagaacg ccaaccccga ctgcaagacc atcctgcgcg ccctgggccc cggcgccagc 1020 

ctggaggaga tgatgaccgc ctgccagggc gtgggcggcc ccagccacaa ggcccgcgtg 1080 
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ctggccgagg ccatgagcca ggccaacacc agcgtgatga tgcagaagag caacttcaag 1140 

ggcccccgcc gcatcgtgaa gtgcttcaac tgcggcaagg agggccacat cgcccgcaac 1200 

tgccgcgccc cccgcaagaa gggctgctgg aagtgcggca aggagggcca ccagatgaag 1260 

gactgcaccg agcgccaggc caacttcctg ggcaagatct ggcccagcca caagggccgc 1320 

cccggcaact tcctgcagag ccgccccgag cccaccgccc cccccgccga gagcttccgc 1380 

ttcgaggaga ccacccccgg ccagaagcag gagagcaagg accgcgagac cctgaccagc 1440 

ctgaagagcc tgttcggcaa cgaccccctg agccagtaa 1479 

<210> 21 
<211> 1509 
<212> DNA 
<213> Artificial 

<220> 

<223> synthetic Gag coding sequence of HIV strain AF110967 
<400> 21 

atgggcgccc gcgccagcat cctgcgcggc gagaagctgg acaagtggga gaagatccgc 60 

ctgcgccccg gcggcaagaa gcactacatg ctgaagcacc tggtgtgggc cagccgcgag 120 

ctggagggct tcgccctgaa ccccggcctg ctggagaccg ccgagggctg caagcagatc 180 

atgaagcagc tgcagcccgc cctgcagacc ggcaccgagg agctgcgcag cctgtacaac 240 

accgtggcca ccctgtactg cgtgcacgcc ggcatcgagg tgcgcgacac caaggaggcc 300 

ctggacaaga tcgaggagga gcagaacaag agccagcaga agacccagca ggccaaggag 360 

gccgacggca aggtgagcca gaactacccc atcgtgcaga acctgcaggg ccagatggtg 420 

caccaggcca tcagcccccg caccctgaac gcctgggtga aggtgatcga ggagaaggcc 480 

ttcagccccg aggtgatccc catgttcacc gccctgagcg agggcgccac cccccaggac 540 

ctgaacacca tgctgaacac cgtgggcggc caccaggccg ccatgcagat gctgaaggac 600 

accatcaacg aggaggccgc cgagtgggac cgcctgcacc ccgtgcaggc cggccccgtg 660 

gcccccggcc agatgcgcga cccccgcggc agcgacatcg ccggcgccac cagcaccctg 720 

caggagcaga tcgcctggat gaccagcaac ccccccgtgc ccgtgggcga catctacaag 780 

cgctggatca tcctgggcct gaacaagatc gtgcgcatgt acagccccgt gagcatcctg 840 

gacatccgcc agggccccaa ggagcccttc cgcgactacg tggaccgctt cttcaagacc 900 

ctgcgcgccg agcaggccac ccaggacgtg aagaactgga tgaccgagac cctgctggtg 960 

cagaacgcca accccgactg caagaccatc ctgcgcgccc tgggccccgg cgccaccctg 1020 
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gaggagatga tgaccgcctg ccagggcgtg ggcggccccg gccacaaggc ccgcgtgctg 1080 

gccgaggcca tgagccaggc caacagcgtg aacatcatga tgcagaagag caacttcaag 1140 

ggcccccgcc gcaacgtgaa gtgcttcaac tgcggcaagg agggccacat cgccaagaac 1200 

tgccgcgccc cccgcaagaa gggctgctgg aagtgcggca aggagggcca ccagatgaag 1260 

gactgcaccg agcgccaggc caacttcctg ggcaagatct ggcccagcca caagggccgc 1320 

cccggcaact tcctgcagaa ccgcagcgag cccgccgccc ccaccgtgcc caccgccccc 1380 

cccgccgaga gcttccgctt cgaggagacc acccccgccc ccaagcagga gcccaaggac 1440 

cgcgagccct accgcgagcc cctgaccgcc ctgcgcagcc tgttcggcag cggccccctg 1500 

agccagtaa 1509 



<210> 22 
<211> 502 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 22 

Met Gly Ala Arg Ala Ser lie Leu Arg Gly Glu Lys Leu Asp Lys Trp 
15 10 15 



Glu Lys lie Arg Leu Arg Pro Gly Gly Lys Lys His Tyr Met Leu Lys 
20 25 30 



His Leu Val Trp Ala Ser Arg Glu Leu Glu Gly Phe Ala Leu Asn Pro 
35 40 45 



Gly Leu Leu Glu Thr Ala Glu Gly Cys Lys Gin lie Met Lys Gin Leu 
50 55 60 



Gin Pro Ala Leu Gin Thr Gly Thr Glu Glu Leu Arg Ser Leu Tyr Asn 
65 70 75 80 



Thr Val Ala Thr Leu Tyr Cys Val His Ala Gly lie Glu Val Arg Asp 
85 90 95 



Thr Lys Glu Ala Leu Asp Lys lie Glu Glu Glu Gin Asn Lys Ser Gin 
100 105 110 



Gin Lys Thr Gin Gin Ala Lys Glu Ala Asp Gly Lys Val Ser Gin Asn 
115 120 125 
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Tyr Pro He Val Gin Asn Leu Gin Gly Gin Met Val His Gin Ala He 
130 135 140 



Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val He Glu Glu Lys Ala 
145 150 155 160 



Phe Ser Pro Glu Val He Pro Met Phe Thr Ala Leu Ser Glu Gly Ala 
165 170 175 



Thr Pro Gin Asp Leu Asn Thr Met Leu Asn Thr Val Gly Gly His Gin 
180 185 190 



Ala Ala Met Gin Met Leu Lys Asp Thr He Asn Glu Glu Ala Ala Glu 
195 200 205 



Trp Asp Arg Leu His Pro Val Gin Ala Gly Pro Val Ala Pro Gly Gin 
210 215 220 



Met Arg Asp Pro Arg Gly Ser Asp He Ala Gly Ala Thr Ser Thr Leu 
225 230 235 240 



Gin Glu Gin He Ala Trp Met Thr Ser Asn Pro Pro Val Pro Val Gly 
245 250 255 



Asp He Tyr Lys Arg Trp He He Leu Gly Leu Asn Lys He Val Arg 
260 265 270 



Met Tyr Ser Pro Val Ser He Leu Asp He Arg Gin Gly Pro Lys Glu 
275 280 285 



Pro Phe Arg Asp Tyr Val Asp Arg Phe Phe Lys Thr Leu Arg Ala Glu 
290 295 300 



Gin Ala Thr Gin Asp Val Lys Asn Trp Met Thr Glu Thr Leu Leu Val 
305 310 315 320 



Gin Asn Ala Asn Pro Asp Cys Lys Thr He Leu Arg Ala Leu Gly Pro 
325 330 335 



Gly Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gin Gly Val Gly Gly 
340 345 350 
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Pro Gly His Lys Ala Arg Val Leu Ala Glu Ala Met Ser Gin Ala Asn 
355 360 365 



Ser Val Asn lie Met Met Gin Lys Ser Asn Phe Lys Gly Pro Arg Arg 
370 375 380 



Asn Val Lys Cys Phe Asn Cys Gly Lys Glu Gly His lie Ala Lys Asn 
385 390 395 400 



Cys Arg Ala Pro Arg Lys Lys Gly Cys Trp Lys Cys Gly Lys Glu Gly 
405 410 415 



His Gin Met Lys Asp Cys Thr Glu Arg Gin Ala Asn Phe Leu Gly Lys 
420 425 430 



lie Trp Pro Ser His Lys Gly Arg Pro Gly Asn Phe Leu Gin Asn Arg 
435 440 445 



Ser Glu Pro Ala Ala Pro Thr Val Pro Thr Ala Pro Pro Ala Glu Ser 
450 455 460 



Phe Arg Phe Glu Glu Thr Thr Pro Ala Pro Lys Gin Glu Pro Lys Asp 
465 470 475 480 



Arg Glu Pro Tyr Arg Glu Pro Leu Thr Ala Leu Arg Ser Leu Phe Gly 
485 490 495 



Ser Gly Pro Leu Ser Gin 
500 



<210> 23 
<211> 849 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 23 

Met Arg Val Met Gly lie Leu Lys Asn Tyr Gin Gin Trp Trp Met Trp 
15 10 15 



Gly lie Leu Gly Phe Trp Met Leu lie lie Ser Ser Val Val Gly Asn 
20 25 30 



Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Lys 
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35 



40 



45 



Thr Thr Leu Phe Cys Thr Ser Asp Ala Lys Ala Tyr Glu Thr Glu Val 
50 55 60 



His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro 
65 70 75 80 



Gin Glu lie Val Leu Glu Asn Val Thr Glu Asn Phe Asn Met Trp Lys 
85 90 95 



Asn Asp Met Val Asp Gin Met His Glu Asp lie lie Ser Leu Trp Asp 
100 105 110 



Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
115 120 125 



Lys Cys Arg Asn Val Asn Ala Thr Asn Asn lie Asn Ser Met lie Asp 
130 135 140 



Asn Ser Asn Lys Gly Glu Met Lys Asn Cys Ser Phe Asn Val Thr Thr 
145 150 155 160 



Glu Leu Arg Asp Arg Lys Gin Glu Val His Ala Leu Phe Tyr Arg Leu 
165 170 175 



Asp Val Val Pro Leu Gin Gly Asn Asn Ser Asn Glu Tyr Arg Leu lie 
180 185 190 



Asn Cys Asn Thr Ser Ala lie Thr Gin Ala Cys Pro Lys Val Ser Phe 
195 200 205 



Asp Pro lie Pro lie His Tyr Cys Thr Pro Ala Gly Tyr Ala lie Leu 
210 215 220 



Lys Cys Asn Asn Gin Thr Phe Asn Gly Thr Gly Pro Cys Asn Asn Val 
225 230 235 240 



Ser Ser Val Gin Cys Ala His Gly lie Lys Pro Val Val Ser Thr Gin 
245 250 255 



Leu Leu Leu Asn Gly Ser Leu Ala Lys Gly Glu lie lie lie Arg Ser 
260 265 270 
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Glu Asn Leu Ala Asn Asn Ala Lys lie lie lie Val Gin Leu Asn Lys 
275 280 285 



Pro Val Lys lie Val Cys Val Arg Pro Asn Asn Asn Thr Arg Lys Ser 
290 295 300 



Val Arg He Gly Pro Gly Gin Thr Phe Tyr Ala Thr Gly Glu He He 
305 310 315 320 



Gly Asp He Arg Gin Ala Tyr Cys He He Asn Lys Thr Glu Trp Asn 
325 330 335 



Ser Thr Leu Gin Gly Val Ser Lys Lys Leu Glu Glu His Phe Ser Lys 
340 345 350 



Lys Ala He Lys Phe Glu Pro Ser Ser Gly Gly Asp Leu Glu He Thr 
355 360 365 



Thr His Ser Phe Asn Cys Arg Gly Glu Phe Phe Tyr Cys Asp Thr Ser 
370 375 380 



Gin Leu Phe Asn Ser Thr Tyr Ser Pro Ser Phe Asn Gly Thr Glu Asn 
385 390 395 400 



Lys Leu Asn Gly Thr He Thr He Thr Cys Arg He Lys Gin He He 
405 410 415 



Asn Met Trp Gin Lys Val Gly Arg Ala Met Tyr Ala Pro Pro He Ala 
420 425 430 



Gly Asn Leu Thr Cys Glu Ser Asn He Thr Gly Leu Leu Leu Thr Arg 
435 440 445 



Asp Gly Gly Lys Thr Gly Pro Asn Asp Thr Glu He Phe Arg Pro Gly 
450 455 460 



Gly Gly Asp Met Arg Asp Asn Trp Arg Asn Glu Leu Tyr Lys Tyr Lys 
465 470 475 480 



Val Val Glu He Lys Pro Leu Gly Val Ala Pro Thr Glu Ala Lys Arg 
485 490 495 
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Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly lie Gly Ala Val Phe 
500 505 510 



Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser lie 
515 520 525 



Thr Leu Thr Val Gin Ala Arg Leu Leu Leu Ser Gly lie Val Gin Gin 
530 535 540 



Gin Asn Asn Leu Leu Arg Ala lie Glu Ala Gin Gin His Leu Leu Gin 
545 550 555 560 



Leu Thr Val Trp Gly lie Lys Gin Leu Gin Thr Arg lie Leu Ala Val 
565 570 575 



Glu Arg Tyr Leu Lys Asp Gin Gin Leu Leu Gly lie Trp Gly Cys Ser 
580 585 590 



Gly Lys Leu lie Cys Thr Thr Ala Val Pro Trp Asn Ser Ser Trp Ser 
595 600 605 



Asn Arg Ser His Asp Glu lie Trp Asp Asn Met Thr Trp Met Gin Trp 
610 615 620 



Asp Arg Glu lie Asn Asn Tyr Thr Asp Thr lie Tyr Arg Leu Leu Glu 
625 630 635 640 



Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Lys Asp Leu Leu Ala Leu 
645 650 655 



Asp Ser Trp Gin Asn Leu Trp Asn Trp Phe Ser lie Thr Asn Trp Leu 
660 665 670 



Trp Tyr lie Lys lie Phe lie Met lie Val Gly Gly Leu lie Gly Leu 
675 680 685 



Arg lie lie Phe Ala Val Leu Ser lie Val Asn Arg Val Arg Gin Gly 
690 695 700 



Tyr Ser Pro Leu Pro Phe Gin Thr Leu Thr Pro Asn Pro Arg Glu Pro 
705 710 715 720 
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Asp Arg Leu Gly Arg lie Glu Glu Glu Gly Gly Glu Gin Asp Arg Gly 
725 730 735 



Arg Ser lie Arg Leu Val Ser Gly Phe Leu Ala Leu Ala Trp Asp Asp 
740 745 750 



Leu Arg Ser Leu Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Phe lie 
755 760 765 



Leu lie Ala Ala Arg Val Leu Glu Leu Leu Gly Gin Arg Gly Trp Glu 
770 775 780 



Ala Leu Lys Tyr Leu Gly Ser Leu Val Gin Tyr Trp Gly Leu Glu Leu 
785 790 795 800 



Lys Lys Ser Ala lie Ser Leu Leu Asp Thr lie Ala lie Ala Val Ala 
805 810 815 



Glu Gly Thr Asp Arg lie lie Glu Phe lie Gin Arg lie Cys Arg Ala 
820 825 830 



lie Arg Asn lie Pro Arg Arg lie Arg Gin Gly Phe Glu Ala Ala Leu 
835 840 845 



Gin 



<210> 24 

<211> 855 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 24 

Met Arg Val Arg Gly lie Leu Arg Ser Trp Gin Gin Trp Trp lie Trp 
1 5 .10 15 



Gly lie Leu Gly Phe Trp lie Cys Ser Gly Leu Gly Asn Leu Trp Val 
20 25 30 



Thr Val Tyr Asp Gly Val Pro Val Trp Arg Glu Ala Ser Thr Thr Leu 
35 40 45 



Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Lys Glu Val His Asn Val 
50 55 60 
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Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gin Glu lie 
65 70 75 80 



Glu Leu Asp Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asp Met 
85 90 95 



Val Asp Gin Met His Glu Asp lie lie Ser Leu Trp Asp Gin Ser Leu 
100 105 110 



Lys Pro Arg Val Lys Leu Thr Pro Leu Cys Val Thr Leu Lys Cys Thr 
115 120 125 



Asn Tyr Ser Thr Asn Tyr Ser Asn Thr Met Asn Ala Thr Ser Tyr Asn 
130 135 140 



Asn Asn Thr Thr Glu Glu lie Lys Asn Cys Thr Phe Asn Met Thr Thr 
145 150 155 160 



Glu Leu Arg Asp Lys Lys Gin Gin Val Tyr Ala Leu Phe Tyr Lys Leu 
165 170 175 



Asp lie Val Pro Leu Asn Ser Asn Ser Ser Glu Tyr Arg Leu lie Asn 
180 185 190 



Cys Asn Thr Ser Ala lie Thr Gin Ala Cys Pro Lys Val Ser Phe Asp 
195 200 205 



Pro lie Pro lie His Tyr Cys Ala Pro Ala Gly Tyr Ala lie Leu Lys 
210 215 220 



Cys Lys Asn Asn Thr Ser Asn Gly Thr Gly Pro Cys Gin Asn Val Ser 
225 230 235 240 



Thr Val Gin Cys Thr His Gly lie Lys Pro Val Val Ser Thr Pro Leu 
245 250 255 



Leu Leu Asn Gly Ser Leu Ala Glu Gly Gly Glu lie lie lie Arg Ser 
260 265 270 



Lys Asn Leu Ser Asn Asn Ala Tyr Thr lie lie Val His Leu Asn Asp 
275 280 285 
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Ser Val Glu lie Val Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Gly 
290 295 300 



lie Arg lie Gly Pro Gly Gin Thr Phe Tyr Ala Thr Glu Asn lie lie 
305 310 315 320 



Gly Asp lie Arg Gin Ala His Cys Asn lie Ser Ala Gly Glu Trp Asn 
325 330 335 



Lys Ala Val Gin Arg Val Ser Ala Lys Leu Arg Glu His Phe Pro Asn 
340 345 350 



Lys Thr lie Glu Phe Gin Pro Ser Ser Gly Gly Asp Leu Glu lie Thr 
355 360 365 



Thr His Ser Phe Asn Cys Arg Gly Glu Phe Phe Tyr Cys Asn Thr Ser 
370 375 380 



Lys Leu Phe Asn Ser Ser Tyr Asn Gly Thr Ser Tyr Arg Gly Thr Glu 
385 390 395 400 



Ser Asn Ser Ser lie lie Thr Leu Pro Cys Arg lie Lys Gin lie lie 
405 410 415 



Asp Met Trp Gin Lys Val Gly Arg Ala lie Tyr Ala Pro Pro lie Glu 
420 425 430 



Gly Asn lie Thr Cys Ser Ser Ser lie Thr Gly Leu Leu Leu Ala Arg 
435 440 445 



Asp Gly Gly Leu Asp Asn lie Thr Thr Glu lie Phe Arg Pro Gin Gly 
450 455 460 



Gly Asp Met Lys Asp Asn Trp Arg Asn Glu Leu Tyr Lys Tyr Lys Val 
465 470 475 480 



Val Glu lie Lys Pro Leu Gly Val Ala Pro Thr Glu Ala Lys Arg Arg 
485 490 495 



Val Val Glu Arg Glu Lys Arg Ala Val Gly He Gly Ala Val He Phe 
500 505 510 
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Gly Phe Leu Gly Ala Ala Gly Ser Asn Met Gly Ala Ala Ser He Thr 
515 520 525 



Leu Thr Ala Gin Ala Arg Gin Leu Leu Ser Gly He Val Gin Gin Gin 
530 535 540 



Ser Asn Leu Leu Arg Ala He Glu Ala Gin Gin His Met Leu Gin Leu 
545 550 555 560 



Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg Val Leu Ala He Glu 
565 570 575 



Arg Tyr Leu Lys Asp Gin Gin Leu Leu Gly He Trp Gly Cys Ser Gly 
580 585 590 



Lys Leu He Cys Thr Thr Thr Val Pro Trp Asn Ser Ser Trp Ser Asn 
595 600 605 



Lys Thr Gin Gly Glu He Trp Glu Asn Met Thr Trp Met Gin Trp Asp 
610 615 620 



Lys Glu He Ser Asn Tyr Thr Gly He He Tyr Arg Leu Leu Glu Glu 
625 630 635 640 



Ser Gin Asn Gin Gin Glu Gin Asn Glu Lys Asp Leu Leu Ala Leu Asp 
645 650 655 



Ser Arg Asn Asn Leu Trp Ser Trp Phe Asn He Ser Asn Trp Leu Trp 
660 665 670 



Tyr He Lys He Phe He Met He Val Gly Gly Leu He Gly Leu Arg 
675 680 685 



He He Phe Ala Val Leu Ser He Val Asn Arg Val Arg Gin Gly Tyr 
690 695 700 



Ser Pro Leu Ser Phe Gin Thr Leu Thr Pro Asn Pro Arg Gly Leu Asp 
705 710 715 720 



Arg Leu Gly Arg He Glu Glu Glu Gly Gly Glu Gin Asp Arg Asp Arg 
725 730 735 



Ser He Arg Leu Val Gin Gly Phe Leu Ala Leu Ala Trp Asp Asp Leu 
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740 



745 



750 



Arg Ser Leu Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Leu lie Leu 
755 760 765 



Val Thr Ala Arg Val Val Glu Leu Leu Gly Arg Ser Ser Pro Arg Gly 
770 775 780 



Leu Gin Arg Gly Trp Glu Ala Leu Lys Tyr Leu Gly Ser Leu Val Gin 
785 790 795 800 



Tyr Trp Gly Leu Glu Leu Lys Lys Ser Ala Thr Ser Leu Leu Asp Ser 
805 810 815 



He Ala He Ala Val Ala Glu Gly Thr Asp Arg He He Glu Val He 
820 825 830 



Gin Arg He Tyr Arg Ala Phe Cys Asn He Pro Arg Arg Val Arg Gin 
835 840 845 



Gly Phe Glu Ala Ala Leu Gin 
850 855 



<210> 25 

<211> 20 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 25 

Asp He Lys Gin Gly Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg 
1-5 10 15 



Phe Phe Lys Thr 
20 



<210> 26 
<211> 60 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 26 

gacataaaac aaggaccaaa agagcccttt agagactatg tagaccggtt ctttaaaacc 



<210> 27 
<211> 20 
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<212> PRT 

<213> Human immunodeficiency virus 



<400> 27 

Asp lie Arg Gin Gly Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg 
15 10 15 



Phe Phe Lys Thr 
20 



<210> 28 
<211> 47 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 28 

Thr lie Thr lie Thr Cys Arg lie Lys Gin lie lie Asn Met Trp Gin 
15 10 15 



Lys Val Gly Arg Ala Met Tyr Ala Pro Pro lie Ala Gly Asn Leu Thr 
20 25 30 



Cys Glu Ser Asn lie Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly 
35 40 45 



<210> 29 
<211> 48 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 29 

Ser lie lie Thr Leu Pro Cys Arg lie Lys Gin lie lie Asp Met Trp 
15 10 15 



Gin Lys Val Gly Arg Ala lie Tyr Ala Pro Pro lie Glu Gly Asn lie 
20 25 30 



Thr Cys Ser Ser Ser lie Thr Gly Leu Leu Leu Ala Arg Asp Gly Gly 
35 40 45 



<210> 30 

<211> 2469 

<212> DNA 

<213> Artificial 

<220> 
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<223> PR975(+) 
<400> 30 

gtcgacgcca ccatggccga ggccatgagc caggccacca gcgccaacat cctgatgcag 60 

cgcagcaact tcaagggccc caagcgcatc atcaagtgct tcaactgcgg caaggagggc 120 

cacatcgccc gcaactgccg cgccccccgc aagaagggct gctggaagtg cggcaaggag 180 

ggccaccaga tgaaggactg caccgagcgc caggccaact tcttccgcga ggacctggcc 240 

ttcccccagg gcaaggcccg cgagttcccc agcgagcaga accgcgccaa cagccccacc 300 

agccgcgagc tgcaggtgcg cggcgacaac ccccgcagcg aggccggcgc cgagcgccag 360 

ggcaccctga acttccccca gatcaccctg tggcagcgcc ccctggtgag catcaaggtg 420 

ggcggccaga tcaaggaggc cctgctggac accggcgccg acgacaccgt gctggaggag 480 

atgagcctgc ccggcaagtg gaagcccaag atgatcggcg gcatcggcgg cttcatcaag 540 

gtgcgccagt acgaccagat cctgatcgag atctgcggca agaaggccat cggcaccgtg 600 

ctgatcggcc ccacccccgt gaacatcatc ggccgcaaca tgctgaccca gctgggctgc 660 

accctgaact tccccatcag ccccatcgag accgtgcccg tgaagctgaa gcccggcatg 720 

gacggcccca aggtgaagca gtggcccctg accgaggaga agatcaaggc cctgaccgcc 780 

atctgcgagg agatggagaa ggagggcaag atcaccaaga tcggccccga gaacccctac 840 

aacacccccg tgttcgccat caagaagaag gacagcacca agtggcgcaa gctggtggac 900 

ttccgcgagc tgaacaagcg cacccaggac ttctgggagg tgcagctggg catcccccac 960 

cccgccggcc tgaagaagaa gaagagcgtg accgtgctgg acgtgggcga cgcctacttc 1020 

agcgtgcccc tggacgagga cttccgcaag tacaccgcct tcaccatccc cagcatcaac 1080 

aacgagaccc ccggcatccg ctaccagtac aacgtgctgc cccagggctg gaagggcagc 1140 

cccagcatct tccagagcag catgaccaag atcctggagc ccttccgcgc ccgcaacccc 1200 

gagatcgtga tctaccagta catggacgac ctgtacgtgg gcagcgacct ggagatcggc 12 60 

cagcaccgcg ccaagatcga ggagctgcgc aagcacctgc tgcgctgggg cttcaccacc 1320 

cccgacaaga agcaccagaa ggagcccccc ttcctgtgga tgggctacga gctgcacccc 1380 

gacaagtgga ccgtgcagcc catcgagctg cccgagaagg agagctggac cgtgaacgac 1440 

atccagaagc tggtgggcaa gctgaactgg gccagccaga tctaccccgg catcaaggtg 1500 

cgccagctgt gcaagctgct gcgcggcgcc aaggccctga ccgacatcgt gcccctgacc 1560 

gaggaggccg agctggagct ggccgagaac cgcgagatcc tgcgcgagcc cgtgcacggc 1620 
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gtgtactacg accccagcaa ggacctggtg gccgagatcc agaagcaggg ccacgaccag 1680 

tggacctacc agatctacca ggagcccttc aagaacctga agaccggcaa gtacgccaag 1740 

atgcgcaccg cccacaccaa cgacgtgaag cagctgaccg aggccgtgca gaagatcgcc 1800 

atggagagca tcgtgatctg gggcaagacc cccaagttcc gcctgcccat ccagaaggag 1860 

acctgggaga cctggtggac cgactactgg caggccacct ggatccccga gtgggagttc 1920 

gtgaacaccc cccccctggt gaagctgtgg taccagctgg agaaggagcc catcatcggc 1980 

gccgagacct tctacgtgga cggcgccgcc aaccgcgaga ccaagatcgg caaggccggc 2040 

tacgtgaccg accggggccg gcagaagatc gtgagcctga ccgagaccac caaccagaag 2100 

accgagctgc aggccatcca gctggccctg caggacagcg gcagcgaggt gaacatcgtg 2160 

accgacagcc agtacgccct gggcatcatc caggcccagc ccgacaagag cgagagcgag 2220 

ctggtgaacc agatcatcga gcagctgatc aagaaggaga aggtgtacct gagctgggtg 2280 

cccgcccaca agggcatcgg cggcaacgag cagatcgaca agctggtgag caagggcatc 2340 

cgcaaggtgc tgttcctgga cggcatcgat ggcggcatcg tgatctacca gtacatggac 2400 

gacctgtacg tgggcagcgg cggccctagg atcgattaaa agcttcccgg ggctagcacc 2460 

ggtgaattc 2469 

<210> 31 
<211> 2463 
<212> DNA 
<213> Artificial 

<220> 

<223> PR975YM 
<400> 31 

gtcgacgcca ccatggccga ggccatgagc caggccacca gcgccaacat cctgatgcag 60 

cgcagcaact tcaagggccc caagcgcatc atcaagtgct tcaactgcgg caaggagggc 120 

cacatcgccc gcaactgccg cgccccccgc aagaagggct gctggaagtg cggcaaggag 180 

ggccaccaga tgaaggactg caccgagcgc caggccaact tcttccgcga ggacctggcc 240 

ttcccccagg gcaaggcccg cgagttcccc agcgagcaga accgcgccaa cagccccacc 300 

agccgcgagc tgcaggtgcg cggcgacaac ccccgcagcg aggccggcgc cgagcgccag 360 

ggcaccctga acttccccca gatcaccctg tggcagcgcc ccctggtgag catcaaggtg 420 

ggcggccaga tcaaggaggc cctgctggac accggcgccg acgacaccgt gctggaggag 480 

atgagcctgc ccggcaagtg gaagcccaag atgatcggcg gcatcggcgg cttcatcaag 540 
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gtgcgccagt acgaccagat cctgatcgag atctgcggca agaaggccat cggcaccgtg 600 

ctgatcggcc ccacccccgt gaacatcatc ggccgcaaca tgctgaccca gctgggctgc 660 

accctgaact tccccatcag ccccatcgag accgtgcccg tgaagctgaa gcccggcatg 720 

gacggcccca aggtgaagca gtggcccctg accgaggaga agatcaaggc cctgaccgcc 780 

atctgcgagg agatggagaa ggagggcaag atcaccaaga tcggccccga gaacccctac 840 

aacacccccg tgttcgccat caagaagaag gacagcacca agtggcgcaa gctggtggac 900 

ttccgcgagc tgaacaagcg cacccaggac ttctgggagg tgcagctggg catcccccac 960 

cccgccggcc tgaagaagaa gaagagcgtg accgtgctgg acgtgggcga cgcctacttc 1020 

agcgtgcccc tggacgagga cttccgcaag tacaccgcct tcaccatccc cagcatcaac 1080 

aacgagaccc ccggcatccg ctaccagtac aacgtgctgc cccagggctg gaagggcagc 1140 

cccagcatct tccagagcag catgaccaag atcctggagc ccttccgcgc ccgcaacccc 1200 

gagatcgtga tctaccaggc ccccctgtac gtgggcagcg acctggagat cggccagcac 1260 

cgcgccaaga tcgaggagct gcgcaagcac ctgctgcgct ggggcttcac cacccccgac 1320 

aagaagcacc agaaggagcc ccccttcctg tggatgggct acgagctgca ccccgacaag 1380 

tggaccgtgc agcccatcga gctgcccgag aaggagagct ggaccgtgaa cgacatccag 1440 

aagctggtgg gcaagctgaa ctgggccagc cagatctacc ccggcatcaa ggtgcgccag 1500 

ctgtgcaagc tgctgcgcgg cgccaaggcc ctgaccgaca tcgtgcccct gaccgaggag 1560 

gccgagctgg agctggccga gaaccgcgag atcctgcgcg agcccgtgca cggcgtgtac 1620 

tacgacccca gcaaggacct ggtggccgag atccagaagc agggccacga ccagtggacc 1680 

taccagatct accaggagcc cttcaagaac ctgaagaccg gcaagtacgc caagatgcgc 1740 

accgcccaca ccaacgacgt gaagcagctg accgaggccg tgcagaagat cgccatggag 1800 

agcatcgtga tctggggcaa gacccccaag ttccgcctgc ccatccagaa ggagacctgg 1860 

gagacctggt ggaccgacta ctggcaggcc acctggatcc ccgagtggga gttcgtgaac 1920 

accccccccc tggtgaagct gtggtaccag ctggagaagg agcccatcat cggcgccgag 1980 

accttctacg tggacggcgc cgccaaccgc gagaccaaga tcggcaaggc cggctacgtg 2040 

accgaccggg gccggcagaa gatcgtgagc ctgaccgaga ccaccaacca gaagaccgag 2100 

ctgcaggcca tccagctggc cctgcaggac agcggcagcg aggtgaacat cgtgaccgac 2160 

agccagtacg ccctgggcat catccaggcc cagcccgaca agagcgagag cgagctggtg 2220 
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aaccagatca tcgagcagct gatcaagaag gagaaggtgt acctgagctg ggtgcccgcc 2280 

cacaagggca tcggcggcaa cgagcagatc gacaagctgg tgagcaaggg catccgcaag 2340 

gtgctgttcc tggacggcat cgatggcggc atcgtgatct accagtacat ggacgacctg 2400 

tacgtgggca gcggcggccc taggatcgat taaaagcttc ccggggctag caccggtgaa 2460 

ttc 2463 

<210> 32 
<211> 2457 
<212> DNA 
<213> Artificial 

<220> 

<223> PR975YMWM 
<400> 32 

gtcgacgcca ccatggccga ggccatgagc caggccacca gcgccaacat cctgatgcag 60 

cgcagcaact tcaagggccc caagcgcatc atcaagtgct tcaactgcgg caaggagggc 120 

cacatcgccc gcaactgccg cgccccccgc aagaagggct gctggaagtg cggcaaggag 180 

ggccaccaga tgaaggactg caccgagcgc caggccaact tcttccgcga ggacctggcc 240 

ttcccccagg gcaaggcccg cgagttcccc agcgagcaga accgcgccaa cagccccacc 300 

agccgcgagc tgcaggtgcg cggcgacaac ccccgcagcg aggccggcgc cgagcgccag 360 

ggcaccctga acttccccca gatcaccctg tggcagcgcc ccctggtgag catcaaggtg 420 

ggcggccaga tcaaggaggc cctgctggac accggcgccg acgacaccgt gctggaggag 480 

atgagcctgc ccggcaagtg gaagcccaag atgatcggcg gcatcggcgg cttcatcaag 540 

gtgcgccagt acgaccagat cctgatcgag atctgcggca agaaggccat cggcaccgtg 600 

ctgatcggcc ccacccccgt gaacatcatc ggccgcaaca tgctgaccca gctgggctgc 660 

accctgaact tccccatcag ccccatcgag accgtgcccg tgaagctgaa gcccggcatg 720 

gacggcccca aggtgaagca gtggcccctg accgaggaga agatcaaggc cctgaccgcc 780 

atctgcgagg agatggagaa ggagggcaag atcaccaaga tcggccccga gaacccctac 840 

aacacccccg tgttcgccat caagaagaag gacagcacca agtggcgcaa gctggtggac 900 

ttccgcgagc tgaacaagcg cacccaggac ttctgggagg tgcagctggg catcccccac 960 

cccgccggcc tgaagaagaa gaagagcgtg accgtgctgg acgtgggcga cgcctacttc 102 0 

agcgtgcccc tggacgagga cttccgcaag tacaccgcct tcaccatccc cagcatcaac 1080 

aacgagaccc ccggcatccg ctaccagtac aacgtgctgc cccagggctg gaagggcagc 1140 



37 



cccagcatct tccagagcag catgaccaag atcctggagc ccttccgcgc ccgcaacccc 1200 

gagatcgtga tctaccaggc ccccctgtac gtgggcagcg acctggagat cggccagcac 1260 

cgcgccaaga tcgaggagct gcgcaagcac ctgctgcgct ggggcttcac cacccccgac 1320 

aagaagcacc agaaggagcc ccccttcctg cccatcgagc tgcaccccga caagtggacc 1380 

gtgcagccca tcgagctgcc cgagaaggag agctggaccg tgaacgacat ccagaagctg 1440 

gtgggcaagc tgaactgggc cagccagatc taccccggca tcaaggtgcg ccagctgtgc 1500 

aagctgctgc gcggcgccaa ggccctgacc gacatcgtgc ccctgaccga ggaggccgag 1560 

ctggagctgg ccgagaaccg cgagatcctg cgcgagcccg tgcacggcgt gtactacgac 1620 

cccagcaagg acctggtggc cgagatccag aagcagggcc acgaccagtg gacctaccag 1680 

atctaccagg agcccttcaa gaacctgaag accggcaagt acgccaagat gcgcaccgcc 1740 

cacaccaacg acgtgaagca gctgaccgag gccgtgcaga agatcgccat ggagagcatc 1800 

gtgatctggg gcaagacccc caagttccgc ctgcccatcc agaaggagac ctgggagacc 1860 

tggtggaccg actactggca ggccacctgg atccccgagt gggagttcgt gaacaccccc 1920 

cccctggtga agctgtggta ccagctggag aaggagccca tcatcggcgc cgagaccttc 1980 

tacgtggacg gcgccgccaa ccgcgagacc aagatcggca aggccggcta cgtgaccgac 2040 

cggggccggc agaagatcgt gagcctgacc gagaccacca accagaagac cgagctgcag 2100 

gccatccagc tggccctgca ggacagcggc agcgaggtga acatcgtgac cgacagccag 2160 

tacgccctgg gcatcatcca ggcccagccc gacaagagcg agagcgagct ggtgaaccag 2220 

atcatcgagc agctgatcaa gaaggagaag gtgtacctga gctgggtgcc cgcccacaag 2280 

ggcatcggcg gcaacgagca gatcgacaag ctggtgagca agggcatccg caaggtgctg 2340 

ttcctggacg gcatcgatgg cggcatcgtg atctaccagt acatggacga cctgtacgtg 2400 

ggcagcggcg gccctaggat cgattaaaag cttcccgggg ctagcaccgg tgaattc 2457 

<210> 33 
<211> 9781 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 33 

tggaagggtt aatttactcc aagaaaaggc aagaaatcct tgatttgtgg gtctatcaca 60 

cacaaggctt cttccctgat tggcaaaact acacaccggg gccaggggtc agatatccac 120 

tgacctttgg atggtgctac aagctagtgc cagttgaccc aggggaggtg gaagaggcca 180 
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acggaggaga agacaactgt ttgctacacc ctatgagcca acatggagca gaggatgaag 240 

atagagaagt attaaagtgg aagtttgaca gcctcctagc acgcagacac atggcccgcg 300 

agctacatcc ggagtattac aaagactgct gacacagaag ggactttccg cctgggactt 360 

tccactgggg cgttccggga ggtgtggtct gggcgggact tgggagtggt caaccctcag 420 

atgctgcata taagcagctg cttttcgcct gtactgggtc tctctcggta gaccagatct 480 

gagcctggga gccctctggc tatctaggga acccactgct taagcctcaa taaagcttgc 540 

cttgagtgct ttaagtagtg tgtgcccatc tgttgtgtga ctctggtaac tagagatccc 600 

tcagaccctt tgtggtagtg tggaaaatct ctagcagtgg cgcccgaaca gggaccagaa 660 

agtgaaagtg agaccagagg agatctctcg acgcaggact cggcttgctg aagtgcacac 720 

ggcaagaggc gagaggggcg gctggtgagt acgccaattt tacttgacta gcggaggcta 780 

gaaggagaga gatgggtgcg agagcgtcaa tattaagcgg cggaaaatta gataaatggg 840 

aaagaattag gttaaggcca gggggaaaga aacattatat gttaaaacat ctagtatggg 900 

caagcaggga gctggaaaga tttgcactta accctggcct gttagaaaca tcagaaggct 960 

gtaaacaaat aataaaacag ctacaaccag ctcttcagac aggaacagag gaacttagat 1020 

cattattcaa cacagtagca actctctatt gtgtacataa agggatagag gtacgagaca 1080 

ccaaggaagc cttagacaag atagaggaag aacaaaacaa atgtcagcaa aaagcacaac 1140 

aggcaaaagc agctgacgaa aaggtcagtc aaaattatcc tatagtacag aatgcccaag 1200 

ggcaaatggt acaccaagct atatcaccta gaacattgaa tgcatggata aaagtaatag 1260 

aggaaaaggc tttcaatcca gaggaaatac ccatgtttac agcattatca gaaggagcca 1320 

ccccacaaga tttaaacaca atgttaaata cagtgggggg acatcaagca gccatgcaaa 1380 

tgttaaaaga taccatcaat gaggaggctg cagaatggga taggacacat ccagtacatg 1440 

cagggcctgt tgcaccaggc cagatgagag aaccaagggg aagtgacata gcaggaacta 1500 

ctagtaccct tcaggaacaa atagcatgga tgacaagtaa tccacctatt ccagtagaag 1560 

acatctataa aagatggata attctggggt taaataaaat agtaagaatg tatagccctg 1620 

ttagcatttt ggacataaaa caagggccaa aagaaccctt tagagactat gtagaccggt 1680 

tctttaaaac cttaagagct gaacaagcta cacaagatgt aaagaattgg atgacagaca 1740 

ccttgttggt ccaaaatgcg aacccagatt gtaagaccat tttaagagca ttaggaccag 1800 

gggcctcatt agaagaaatg atgacagcat gtcagggagt gggaggacct agccataaag 1860 
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caagagtgtt ggctgaggca atgagccaag caaacagtaa catactagtg cagagaagca 1920 

attttaaagg ctctaacaga attattaaat gtttcaactg tggcaaagta gggcacatag 1980 

ccagaaattg cagggcccct aggaaaaagg gctgttggaa atgtggacag gaaggacacc 2040 

aaatgaaaga ctgtactgag aggcaggcta attttttagg gaaaatttgg ccttcccaca 2100 

aggggaggcc agggaatttc ctccagaaca gaccagagcc aacagcccca ccagcagaac 2160 

caacagcccc accagcagag agcttcaggt tcgaggagac aacccccgtg ccgaggaagg 2220 

agaaagagag ggaaccttta acttccctca aatcactctt tggcagcgac cccttgtctc 2280 

aataaaagta gagggccaga taaaggaggc tctcttagac acaggagcag atgatacagt 2340 

attagaagaa atagatttgc cagggaaatg gaaaccaaaa atgatagggg gaattggagg 2400 

ttttatcaaa gtaagacagt atgatcaaat acttatagaa atttgtggaa aaaaggctat 2460 

aggtacagta ttagtagggc ctacaccagt caacataatt ggaagaaatc tgttaactca 2520 

gcttggatgc acactaaatt ttccaattag tcctattgaa actgtaccag taaaattaaa 2580 

accaggaatg gatggcccaa aggtcaaaca atggccattg acagaagaaa aaataaaagc 2640 

attaacagca atttgtgagg aaatggagaa ggaaggaaaa attacaaaaa ttgggcctga 2700 

taatccatat aacactccag tatttgccat aaaaaagaag gacagtacta agtggagaaa 2760 

attagtagat ttcagggaac tcaataaaag aactcaagac ttttgggaag ttcaattagg 2820 

aataccacac ccagcaggat taaaaaagaa aaaatcagtg acagtgctag atgtggggga 2880 

tgcatatttt tcagttcctt tagatgaaag cttcaggaaa tatactgcat tcaccatacc 2940 

tagtataaac aatgaaacac cagggattag atatcaatat aatgtgctgc cacagggatg 3000 

gaaaggatca ccagcaatat tccagagtag catgacaaaa atcttagagc ccttcagagc 3060 

aaaaaatcca gacatagtta tctatcaata tatggatgac ttgtatgtag gatctgactt 3120 

agaaataggg caacatagag caaaaataga agagttaagg gaacatttat tgaaatgggg 3180 

atttacaaca ccagacaaga aacatcaaaa agaaccccca tttctttgga tggggtatga 3240 

actccatcct gacaaatgga cagtacaacc tatactgctg ccagaaaagg atagttggac 3300 

tgtcaatgat atacagaagt tagtgggaaa attaaactgg gcaagtcaga tttacccagg 3360 

gattaaagta aggcaactct gtaaactcct caggggggcc aaagcactaa cagacatagt 3420 

accactaact gaagaagcag aattagaatt ggcagagaac agggaaattt taagagaacc 3480 

agtacatgga gtatattatg atccatcaaa agacttgata gctgaaatac agaaacaggg 3540 

gcatgaacaa tggacatatc aaatttatca agaaccattt aaaaatctga aaacagggaa 3600 

40 



gtatgcaaaa atgaggacta cccacactaa tgatgtaaaa cagttaacag aggcagtgca 3660 

aaaaatagcc atggaaagca tagtaatatg gggaaagact cctaaattta gactacccat 3720 

ccaaaaagaa acatgggaga catggtggac agactattgg caagccacct ggatccctga 3780 

gtgggagttt gttaataccc ctcccctagt aaaattatgg taccaactag aaaaagatcc 3840 

catagcagga gtagaaactt tctatgtaga tggagcaact aatagggaag ctaaaatagg 3900 

aaaagcaggg tatgttactg acagaggaag gcagaaaatt gttactctaa ctaacacaac 3960 

aaatcagaag actgagttac aagcaattca gctagctctg caggattcag gatcagaagt 4020 

aaacatagta acagactcac agtatgcatt aggaatcatt caagcacaac cagataagag 4080 

tgactcagag atatttaacc aaataataga acagttaata aacaaggaaa gaatctacct 4140 

gtcatgggta ccagcacata aaggaattgg gggaaatgaa caagtagata aattagtaag 42 00 

taagggaatt aggaaagtgt tgtttctaga tggaatagat aaagctcaag aagagcatga 42 60 

aaggtaccac agcaattgga gagcaatggc taatgagttt aatctgccac ccatagtagc 4320 

aaaagaaata gtagctagct gtgataaatg tcagctaaaa ggggaagcca tacatggaca 4380 

agtcgactgt agtccaggga tatggcaatt agattgtacc catttagagg gaaaaatcat 4440 

cctggtagca gtccatgtag ctagtggcta catggaagca gaggttatcc cagcagaaac 4500 

aggacaagaa acagcatatt ttatattaaa attagcagga agatggccag tcaaagtaat 4560 

acatacagac aatggcagta attttaccag tactgcagtt aaggcagcct gttggtgggc 4620 

aggtatccaa caggaatttg gaattcccta caatccccaa agtcagggag tggtagaatc 4680 

catgaataaa gaattaaaga aaataatagg acaagtaaga gatcaagctg agcaccttaa 4740 

gacagcagta caaatggcag tattcattca caattttaaa agaaaagggg gaattggggg 4800 

gtacagtgca ggggaaagaa taatagacat aatagcaaca gacatacaaa ctaaagaatt 4860 

acaaaaacaa attataagaa ttcaaaattt tcgggtttat tacagagaca gcagagaccc 4920 

tatttggaaa ggaccagccg aactactctg gaaaggtgaa ggggtagtag taatagaaga 4980 

taaaggtgac ataaaggtag taccaaggag gaaagcaaaa atcattagag attatggaaa 5040 

acagatggca ggtgctgatt gtgtggcagg tggacaggat gaagattaga gcatggaata 5100 

gtttagtaaa gcaccatatg tatatatcaa ggagagctag tggatgggtc tacagacatc 5160 

attttgaaag cagacatcca aaagtaagtt cagaagtaca tatcccatta ggggatgcta 5220 

gattagtaat aaaaacatat tggggtttgc agacaggaga aagagattgg catttgggtc 52 80 
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atggagtctc catagaatgg agactgagag aatacagcac acaagtagac cctgacctgg 5340 

cagaccagct aattcacatg cattattttg attgttttac agaatctgcc ataagacaag 5400 

ccatattagg acacatagtt tttcctaggt gtgactatca agcaggacat aagaaggtag 5460 

gatctctgca atacttggca ctgacagcat tgataaaacc aaaaaagaga aagccacctc 5520 

tgcctagtgt tagaaaatta gtagaggata gatggaacga cccccagaag accaggggcc 5580 

gcagagggaa ccatacaatg aatggacact agagattcta gaagaactca agcaggaagc 5640 

tgtcagacac tttcctagac catggctcca tagcttagga caatatatct atgaaaccta 57 00 

tggggatact tggacgggag ttgaagctat aataagagta ctgcaacaac tactgttcat 5760 

tcatttcaga attggatgcc aacatagcag aataggcatc ttgcgacaga gaagagcaag 5820 

aaatggagcc agtagatcct aaactaaagc cctggaacca tccaggaagc caacctaaaa 5880 

cagcttgtaa taattgcttt tgcaaacact gtagctatca ttgtctagtt tgctttcaga 5940 

caaaaggttt aggcatttcc tatggcagga agaagcggag acagcgacga agcgctcctc 6000 

caagtggtga agatcatcaa aatcctctat caaagcagta agtacacata gtagatgtaa 6060 

tggtaagttt aagtttattt aaaggagtag attatagatt aggagtagga gcattgatag 6120 

tagcactaat catagcaata atagtgtgga ccatagcata tatagaatat aggaaattgg 6180 

taagacaaaa gaaaatagac tggttaatta aaagaattag ggaaagagca gaagacagtg 6240 

gcaatgagag tgatggggac acagaagaat tgtcaacaat ggtggatatg gggcatctta 6300 

ggcttctgga tgctaatgat ttgtaacacg gaggacttgt gggtcacagt ctactatggg 63 60 

gtacctgtgt ggagagaagc aaaaactact ctattctgtg catcagatgc taaagcatat 6420 

gagacagaag tgcataatgt ctgggctaca catgcttgtg tacccacaga ccccaaccca 6480 

caagaaatag ttttgggaaa tgtaacagaa aattttaata tgtggaaaaa taacatggca 6540 

gatcagatgc atgaggatat aatcagttta tgggatcaaa gcctaaagcc atgtgtaaag 6600 

ttgaccccac tctgtgtcac tttaaactgt acagatacaa atgttacagg taatagaact 6660 

gttacaggta atacaaatga taccaatatt gcaaatgcta catataagta tgaagaaatg 6720 

aaaaattgct ctttcaatgc aaccacagaa ttaagagata agaaacataa agagtatgca 6780 

ctcttttata aacttgatat agtaccactt aatgaaaata gtaacaactt tacatataga 6840 

ttaataaatt gcaatacctc aaccataaca caagcctgtc caaaggtctc ttttgacccg 6900 

attcctatac attactgtgc tccagctgat tatgcgattc taaagtgtaa taataagaca 6960 

ttcaatggga caggaccatg ttataatgtc agcacagtac aatgtacaca tggaattaag 7020 
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ccagtggtat caactcaact actgttaaat ggtagtctag cagaagaagg gataataatt 7080 

agatctgaaa atttgacaga gaataccaaa acaataatag tacatcttaa tgaatctgta 7140 

gagattaatt gtacaaggcc caacaataat acaaggaaaa gtgtaaggat aggaccagga 7200 

caagcattct atgcaacaaa tgacgtaata ggaaacataa gacaagcaca ttgtaacatt 72 60 

agtacagata gatggaataa aactttacaa caggtaatga aaaaattagg agagcatttc 7320 

cctaataaaa caataaaatt tgaaccacat gcaggagggg atctagaaat tacaatgcat 73 80 

agctttaatt gtagaggaga atttttctat tgcaatacat caaacctgtt taatagtaca 7440 

tactacccta agaatggtac atacaaatac aatggtaatt caagcttacc catcacactc 7500 

caatgcaaaa taaaacaaat tgtacgcatg tggcaagggg taggacaagc aatgtatgcc 7560 

cctcccattg caggaaacat aacatgtaga tcaaacatca caggaatact attgacacgt 7620 

gatgggggat ttaacaacac aaacaacgac acagaggaga cattcagacc tggaggagga 7680 

gatatgaggg ataactggag aagtgaatta tataaatata aagtggtaga aattaagcca 7740 

ttgggaatag cacccactaa ggcaaaaaga agagtggtgc agagaaaaaa aagagcagtg 7800 

ggaataggag ctgtgttcct tgggttcttg ggagcagcag gaagcactat gggcgcagcg 7860 

tcaataacgc tgacggtaca ggccagacaa ctgttgtctg gtatagtgca acagcaaagc 7920 

aatttgctga aggctataga ggcgcaacag catatgttgc aactcacagt ctggggcatt 7980 

aagcagctcc aggcgagagt cctggctata gaaagatacc taaaggatca acagctccta 8040 

gggatttggg gctgctctgg aagactcatc tgcaccactg ctgtgccttg gaactccagt 8100 

tggagtaata aatctgaagc agatatttgg gataacatga cttggatgca gtgggataga 8160 

gaaattaata attacacaga aacaatattc aggttgcttg aagactcgca aaaccagcag 8220 

gaaaagaatg aaaaagattt attagaattg gacaagtgga ataatctgtg gaattggttt 8280 

gacatatcaa actggctgtg gtatataaaa atattcataa tgatagtagg aggcttgata 8340 

ggtttaagaa taatttttgc tgtgctctct atagtgaata gagttaggca gggatactca 8400 

cctttgtcat ttcagaccct taccccaagc ccgaggggac tcgacaggct cggaggaatc 8460 

gaagaagaag gtggagagca agacagagac agatccatac gattggtgag cggattcttg 8520 

tcgcttgcct gggacgatct gcggagcctg tgcctcttca gctaccaccg cttgagagac 8580 

ttcatattaa ttgcagtgag ggcagtggaa cttctgggac acagcagtct caggggacta 8640 

cagagggggt gggagatcct taagtatctg ggaagtcttg tgcagtattg gggtctagag 87 00 
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ctaaaaaaga gtgctattag tccgcttgat accatagcaa tagcagtagc tgaaggaaca 8760 

gataggatta tagaattggt acaaagaatt tgtagagcta tcctcaacat acctaggaga 8820 

ataagacagg gctttgaagc agctttgcta taaaatggga ggcaagtggt caaaacgcag 8880 

catagttgga tggcctgcag taagagaaag aatgagaaga actgagccag cagcagaggg 8940 

agtaggagca gcgtctcaag acttagatag acatggggca cttacaagca gcaacacacc 9000 

tgctactaat gaagcttgtg cctggctgca agcacaagag gaggacggag atgtaggctt 9060 

tccagtcaga cctcaggtac ctttaagacc aatgacttat aagagtgcag tagatctcag 9120 

cttcttttta aaagaaaagg ggggactgga agggttaatt tactctagga aaaggcaaga 9180 

aatccttgat ttgtgggtct ataacacaca aggcttcttc cctgattggc aaaactacac 9240 

atcggggcca ggggtccgat tcccactgac ctttggatgg tgcttcaagc tagtaccagt 9300 

tgacccaagg gaggtgaaag aggccaatga aggagaagac aactgtttgc tacaccctat 9360 

gagccaacat ggagcagagg atgaagatag agaagtatta aagtggaagt ttgacagcct 9420 

tctagcacac agacacatgg cccgcgagct acatccggag tattacaaag actgctgaca 9480 

cagaagggac tttccgcctg ggactttcca ctggggcgtt ccgggaggtg tggtctgggc 9540 

gggacttggg agtggtcacc ctcagatgct gcatataagc agctgctttt cgcttgtact 9600 

gggtctctct cggtagacca gatctgagcc tgggagctct ctggctatct agggaaccca 9660 

ctgcttaggc ctcaataaag cttgccttga gtgctctaag tagtgtgtgc ccatctgttg 9720 

tgtgactctg gtaactagag atccctcaga ccctttgtgg tagtgtggaa aatctctagc 9780 

a 9781 

<210> 34 
<211> 203 
<212> DNA 

<213> Hximan immunodeficiency virus 
<400> 34 

gctgaggcaa tgagccaagc aaccagcgca aacatactga tgcagagaag caatttcaaa 60 

ggccctaaaa gaattattaa atgtttcaac tgtggcaagg aagggcacat agctagaaat 120 

tgtagggccc ctaggaaaaa aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa 180 

gactgtactg agaggcaggc taa 203 



<210> 35 
<211> 2151 
<212> DNA 
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<213> Human immunodeficiency virus 
<400> 35 

ttttttaggg aagatttggc cttcccacaa gggaaggcca gggaatttcc ttcagaacag 60 

aacagagcca acagccccac cagcagagag cttcaagttc gaggagacaa cccccgctcc 120 

gaagcaggag ccgaaagaca gggaaccctt aatttccctc aaatcactct ttggcagcga 180 

ccccttgtct caataaaagt agggggtcaa ataaaggagg ctctcttaga cacaggagct 240 

gatgatacag tattagaaga aatgagtttg ccaggaaaat ggaaaccaaa aatgatagga 300 

ggaattggag gttttatcaa agtaagacag tatgatcaaa tacttataga aatttgtgga 3 60 

aaaaaggcta taggtacagt attaatagga cctacacctg tcaacataat tggaaggaat 420 

atgttgactc agcttggatg cacactaaat tttccaatta gtcccattga aactgtgcca 480 

gtaaaattaa agccaggaat ggatggccca aaggttaaac aatggccatt gacagaagag 540 

aaaataaaag cattaacagc aatttgtgaa gaaatggaga aagaaggaaa aattacaaaa 600 

attgggcctg aaaatccata taacactcca gtatttgcca taaaaaagaa ggacagtact 660 

aagtggagaa agttagtaga tttcagggaa cttaataaaa gaactcaaga cttttgggaa 720 

gttcaattag gaataccaca cccagcaggg ttaaaaaaga aaaaatcagt gacagtactg 780 

gatgtggggg atgcatattt ttcagttcct ttagatgagg acttcaggaa atatactgca 840 

ttcaccatac ctagtataaa caatgaaaca ccagggatta gatatcaata taatgtgctt 900 

ccacagggat ggaaaggatc accatcaata ttccagagta gcatgacaaa aatcttagag 960 

ccctttagag caagaaatcc agaaatagtc atctatcaat atatggatga cttgtatgta 1020 

ggatctgact tagaaatagg gcaacataga gcaaaaatag aggagttaag aaaacatctg 1080 

ttaaggtggg gatttaccac accggacaag aaacatcaga aagaaccccc atttctttgg 1140 

atggggtatg aactccatcc tgacaaatgg acagtacagc ctatagagtt gccagaaaag 1200 

gaaagctgga ctgtcaatga tatacagaag ttagtgggaa aattaaattg ggccagtcag 12 60 

atttacccag gaattaaagt aaggcaactt tgtaaactcc ttaggggggc caaagcacta 1320 

acagatatag taccactaac tgaagaagca gaattagaat tggcagagaa cagggaaatt 1380 

ctaagagaac cagtacatgg agtatattat gacccatcaa aagacttggt agctgaaata 1440 

cagaaacagg ggcatgacca atggacatat caaatttacc aagaaccatt caaaaacctg 1500 

aaaacaggga agtatgcaaa aatgaggact gcccacacta atgatgtaaa acagttaaca 1560 

gaggcagtgc aaaaaatagc tatggaaagc atagtaatat ggggaaagac tcctaaattt 1620 
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agactaccca 


tccaaaaaga 


aacatgggag 


acatggtgga 


cagactattg 


gcaagccacc 


1680 


tggattcctg 


agtgggagtt 


tgttaatacc 


cctcccttag 


taaaattatg 


gtaccagcta 


1740 


gagaaagaac 


ccataatagg 


agcagaaact 


ttctatgtag 


atggagcagc 


taatagggaa 


1800 


actaaaatag 


gaaaagcagg 


gtatgttact 


gacagaggaa 


ggcagaaaat 


tgtttctcta 


1860 


acagaaacaa 


caaatcagaa 


gactgaatta 


caagcaattc 


agctagcttt 


gcaagattca 


1920 


ggatcagaag 


taaacatagt 


aacagactca 


cagtatgcat 


taggaatcat 


tcaagcacaa 


1980 


ccagataaga 


gtgaatcaga 


gttagtcaac 


caaataatag 


aacaattaat 


aaaaaaggaa 


2040 


aaggtctacc 


tgtcatgggt 


accagcacat 


aaaggaattg 


gaggaaatga 


acaaatagat 


2100 


aaattagtaa 


gtaagggaat 


caggaaagtg 


ctgtttctag 


atggaataga 


t 


2151 



<210> 36 
<211> 54 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 36 

ggcggcatcg tgatctacca gtacatggac gacctgtacg tgggcagcgg cggc 54 



<210> 37 
<211> 18 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 37 

Gly Gly lie Val lie Tyr Gin Tyr Met Asp Asp Leu Tyr Val Gly Ser 
15 10 15 



Gly Gly 



<210> 38 

<211> 38 

<212> DNA 

<213> Artificial 

<220> 

<223> primer SlFCSacTA 

<400> 38 

gtttcttgag ctctggaagg gttaatttac tccaagaa 38 



<210> 39 
<211> 38 
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<212> DNA 

<213> Artificial 

<220> 

<223> primer SlFTSacTA 

<400> 39 

gtttcttgag ctctggaagg gttaatttac tctaagaa 

<210> 40 

<211> 35 

<212> DNA 

<213> Artificial 

<220> 

<223> primer S145RTSalTA 

<400> 40 

gtttcttgtc gacttgtcca tgtatggctt cccct 



<210> 41 

<211> 34 

<212> DNA 

<213> Artificial 

<220> 

<223> primer S145RCSalTA 

<400> 41 

gtttcttgtc gacttgtcca tgcatggctt ccct 



<210> 42 

<211> 38 

<212> DNA 

<213> Artificial 

<220> 

<223> primer S245FASalTA 

<400> 42 

gtttcttgtc gactgtagtc caggaatatg gcaattag 



<210> 43 

<211> 38 

<212> DNA 

<213> Artificial 

<220> 

<223> primer S245FGSalTA 

<400> 43 

gtttcttgtc gactgtagtc cagggatatg gcaattag 
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<210> 44 

<211> 39 

<212> DNA 

<213> Artificial 

<220> 

<223> primer S2FullNotTA 
<400> 44 

gtttcttgcg gccgctgcta gagattttcc acactacca 39 

<210> 45 
<211> 9738 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 45 

tggaagggtt aatttactcc aggaaaaggc aagagatcct tgatttatgg gtctatcaca 60 

cacaaggcta cttccctgat tggcaaaact acacaccggg accaggggtc agatatccac 120 

tgacctttgg atggtgcttc aagctagtgc cagttgaccc aagggaagta gaagaggcca 180 

acggaggaga agacaactgt ttgctacacc ctatgagcca gtatggaatg gatgatgaac 240 

acaaagaagt gttacagtgg aagtttgaca gcagcctagc acgcagacac ctggcccgcg 300 

agctacatcc ggattattac aaagactgct gacacagaag ggactttccg cctgggactt 360 

tccactgggg cgttccaggg ggagtggtct gggcgggact gggagtggcc agccctcaga 420 

tgctgcatat aagcagcggc ttttcgcctg tactgggtct ctctaggtag accagatccg 480 

agcctgggag ctctctgtct atctggggaa cccactgctt aggcctcaat aaagcttgcc 540 

ttgagtgctc taagtagtgt gtgcccatct gttgtgtgac tctggtaact ctggtaacta 600 

gagatccctc agaccctttg tggtagtgtg gaaaatctct agcagtggcg cccgaacagg 660 

gacttgaaag cgaaagtgag accagagaag atctctcgac gcaggactcg gcttgctgaa 720 

gtgcactcgg caagaggcga ggggggcgac tggtgagtac gccaaaattt tttttgacta 780 

gcggaggcta gaaggagaga gatgggtgcg agagcgtcaa tattaagagg gggaaaatta 840 

gacaaatggg aaaaaattag gttacggcca ggggggagaa aacactatat gctaaaacac 900 

ctagtatggg caagcagaga gctggaaaga tttgcagtta accctggcct tttagagaca 960 

tcagacggat gtagacaaat aataaaacag ctacaaccag ctcttcagac aggaacagag 1020 

gaaattagat cattatttaa cacagtagca actctctatt gtgtacataa agggatagat 1080 

gtacgagaca ccaaggaagc cttagacaag atagaggagg aacaaaacaa atgtcagcaa 1140 
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aaaacacagc aggcggaagc ggctgacaaa aaggtcagtc aaaattatcc tatagtgcag 1200 

aacctccaag ggcaaatggt acaccaggcc atatcaccta gaaccttgaa tgcatgggta 1260 

aaagtaatag aggagaaggc ttttagccca gaggtaatac ccatgtttac agcattatca 1320 

gaaggagcca ccccacaaga tttaaacacc atgttaaata cagtgggggg acatcaagca 1380 

gccatgcaaa tgttaaaaga taccatcaat gaggaggctg cagaatggga taggttacat 1440 

ccagtacatg cagggcctgt tgcaccaggc cagatgagag aaccaagggg aagtgacata 1500 

gcaggaacta ctagtaccct tcaagaacaa atagcatgga tgacaagtaa cccacctatc 1560 

ccagtagggg acatctataa aaggtggata attctggggt taaataaaat agtaagaatg 1620 

tacagccctg tcagcatttt agacataaaa caaggaccaa aggaaccctt tagagactat 1680 

gtagaccggt tcttcaaaac tttaagagct gaacaatcta cacaagaggt aaaaaattgg 1740 

atgacagaca ccttgttagt ccaaaatgcg aacccagatt gtaagaccat tttaagagca 1800 

ttaggaccag gggcttcatt agaagaaatg atgacagcat gtcagggagt gggaggacct 1860 

agccacaaag caagagtttt ggctgaggca atgagccaag caaacaatac aagtgtaatg 1920 

atacagaaaa gcaattttaa aggccctaga agagctgtta aatgtttcaa ctgtggcagg 1980 

gaagggcaca tagccaggaa ttgcagggcc cctaggaaaa ggggctgttg gaaatgtgga 2040 

aaggaaggac accaaatgaa agactgtact gagaggcagg ctaatttttt agggaaaatt 2100 

tggccttccc acaaggggag gccagggaat ttccttcaga gcagaccaga gccaacagcc 2160 

ccaccactag aaccaacagc cccaccagca gagagcttca agttcaagga gactccgaag 2220 

caggagccga aagacaggga acctttaact tccctcaaat cactctttgg cagcgacccc 2280 

ttgtctcaat aaaagtagcg ggccaaacaa aggaggctct tttagataca ggagcagatg 2340 

atacagtact agaagaaata aacttgccag gaaaatggaa accaaaaatg ataggaggaa 2400 

ttggaggttt tatcaaagta agacagtatg atcaaatact tatagaaatt tgtggaaaaa 2460 

gggctatagg tacagtatta gtaggaccta cacctgtcaa cataattgga agaaatctgt 2520 

tgactcagct tggatgcaca ctaaattttc caattagccc cattgaaact gtaccagtaa 2580 

aattaaagcc aggaatggat ggcccaaagg ttaaacaatg gccattgaca gaagaaaaaa 2 640 

taaaagcatt aacagaaatt tgtgaggaaa tggagaagga aggaaaaatt acaaaaattg 2700 

ggcctgaaaa tccatataac actccagtat ttgccataaa gaagaaggac agtacaaagt 2760 

ggagaaaatt agtagatttc agggaactca ataaaagaac tcaagacttt tgggaagtcc 2820 

aattaggaat accacaccca gcagggttaa aaaagaaaaa atcagtgaca gtactggatg 2880 
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tgggagatgc atatttttca gtccctttag atgagagctt cagaaaatat actgcattca 2940 

ccatacctag tataaacaat gaaacaccag ggattagata tcaatataat gttcttccac 3000 

agggatggaa aggatcacca gcaatattcc agagtagcat gacaagaatc ttagagccct 3060 

ttagaacaca aaacccagaa gtagttatct atcaatatat ggatgactta tatgtaggat 3120 

ctgacttaga aatagggcaa catagagcaa aaatagagga gttaagagga cacctattga 3180 

aatggggatt taccacacca gacaagaaac atcagaaaga acccccattt ctttggatgg 3240 

ggtatgaact ccatcctgac aaatggacag tacagcctat acagctgcca gaaaaggaga 3300 

gctggactgt caatgatata cagaagttag tgggaaagtt aaactgggca agtcagattt 3360 

acccagggat taaagtaagg caactgtgta aactccttag gggagccaaa gcactaacag 3420 

acatagtgcc actgactgaa gaagcagaat tagaattggc tgagaacagg gaaattctaa 3480 

aagaaccagt acatggagta tattatgacc catcaaaaga tttaatagct gaaatacaga 3540 

aacaggggaa tgaccaatgg acatatcaaa tttaccaaga accatttaaa aatctgagaa 3600 

caggaaagta tgcaaaaatg aggactgccc acactaatga tgtgaaacag ttagcagagg 3660 

cagtgcaaaa gataacccag gaaagcatag taatatgggg aaaaactcct aaatttagac 3720 

tacccatccc aaaagaaaca tgggagacat ggtggtcaga ctattggcaa gccacctgga 3780 

ttcctgagtg ggagtttgtc aatacccctc ccctagtaaa attgtggtac cagctggaaa 3840 

aagaacccat agtaggggca gaaactttct atgtagatgg agcagccaat agggaaacta 3900 

aaataggaaa agcagggtat gtcactgaca aaggaaggca gaaagttgtt tccttcactg 3960 

aaacaacaaa tcagaagact gaattacaag caattcagct agctttgcag gattcagggc 4020 

cagaagtaaa catagtaaca gactcacagt atgcattagg aatcattcaa gcacaaccag 4080 

ataagagtga atcagaatta gtcagtcaaa taatagaaca gttgataaaa aaggaaaaag 4140 

tctacctatc atgggtacca gcacataaag gaattggagg aaatgaacaa gtagacaaat 4200 

tagtaagtag tggaatcaga aaagtactgt ttctagatgg aatagataaa gctcaagaag 4260 

agcatgaaaa atatcacagc aattggagag caatggctag tgagtttaat ctgccaccca 4320 

tagtagcaaa ggaaatagta gccagctgtg ataaatgtca gctaaaaggg gaagccatgc 4380 

atggacaagt cgactgtagt ccaggaatat ggcaattaga ctgtacacat ttagaaggaa 4440 

aaatcatcct agtagcagtc catgtagcca gtggctacat ggaagcagag gttatcccag 4500 

cagaaacagg acaagaaaca gcatacttta tactaaaatt agcaggaaga tggccagtca 4560 
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aagtaataca tacagataat ggcagtaatt tcaccagtac cgcagttaag gcagcctgtt 4620 

ggtgggcaga tatccaacgg gaatttggaa ttccctacaa tccccaaagt caaggagtag 4680 

tagaatccat gaataaagaa ttaaagaaaa tcatagggca agtaagagat caagctgagc 4740 

accttaagac agcagtacaa atggcagtat tcattcacaa ttttaaaaga aaagggggga 4800 

ttggggggta cagtgcaggg gagagaataa tagacataat agcatcagac atacaaacta 4860 

aagaattaca aaaacaaatt ataaaaattc aaaattttcg ggtttattac agagacagca 4920 

gagaccctat ttggaaagga ccagccaaac tactctggaa aggtgaaggg gcagtagtaa 4980 

tacaagataa tagtgatata aaggtagtac caagaaggaa agcaaaaatc attaaggact 5040 

atggaaaaca gatggcaggt gctgattgtg tggcaggtag acaggatgaa gattagaaca 5100 

tggcacagtt tagtaaagca ccatatgtat gtttcgagga gagctgatgg atggttctac 5160 

agacatcatt atgaaagcag acacccaaaa gtaagttcag aagtacacat cccattagga 5220 

gatgccaggt tagtaataaa aacatattgg ggtctgcaga caggagaaag agcttggcat 5280 

ttgggtcacg gagtctccat agaatggaga ttgagaagat atagcacaca agtagaccct 5340 

gacctgacag accaactaat tcatatgcat tattttgatt gttttgcaga atctgccata 5400 

aggaaagcca tactaggaca gatagttagc cctaagtgtg actatcaagc aggacataac 5460 

aaggtaggat ctctacaata cttggcactg acagcattga taaaaccaaa aaagataaag 5520 

ccacctctgc ctagtgttag gaaattagta gaggatagat ggaacaagcc ccagaagacc 5580 

aggggccgca gagggaacca tacaatgaat ggacactaga gcttttagaa gaactcaagc 5640 

aggaagctgt cagacacttt cctagaccat ggctccataa cttaggacaa catatctatg 5700 

aaacctatgg agatacttgg acaggagttg aagcaataat aagaatcctg caacaattac 5760 

tgtttattca tttcaggatt gggtgccatc atagcagaat aggcattttg cgacagagaa 5820 

gagcaagaaa tggagccaat agatcctaac ctagaaccct ggaaccatcc aggaagtcag 5880 

cctaaaactg cttgtaatgg gtgttactgt aaacgttgca gctatcattg tctagtttgc 5940 

tttcagaaaa aaggcttagg catttactat ggcaggaaga agcggagaca gcgacgaagc 6000 

gctcctccaa gcaataaaga tcatcaagat cctctaccaa agcagtaagt accgaatagt 6060 

atatgtaatg ttagatttaa ctgcaagaat agattctaga ttaggaatag gagcattgat 6120 

agtagcacta atcatagcaa taatagtgtg gaccatagta tatatagaat ataggaaatt 6180 

ggtaaggcaa aggaaaatag actggttagt taaaaggatt agggaaagag cagaagacag 6240 

tggcaatgag agcgaggggg atactgaaga attatcgaca ctggtggata tggggcatct 6300 
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taggcttttg gatgctaatg atgtgtaatg tgaagggctt gtgggtcaca gtctactacg 63 60 

gggtacctgt ggggagagaa gcaaaaacta ctctattttg tgcatcagat gctaaagcat 6420 

atgagaaaga agtgcataat gtctgggcta cacatgcctg tgtacccaca gaccccaacc 6480 

cacaagaagt gattttgggc aatgtaacag aaaattttaa catgtggaaa aatgacatgg 6540 

tggatcagat gcaggaagat ataatcagtt tatgggatca aagccttaag ccatgtgtaa 6600 

aattgacccc actctgtgtc actttaaact gtacaaatgc aactgttaac tacaataata 6660 

cctctaaaga catgaaaaat tgctctttct atgtaaccac agaattaaga gataagaaaa 6720 

agaaagaaaa tgcacttttt tatagacttg atatagtacc acttaataat aggaagaatg 6780 

ggaatattaa caactataga ttaataaatt gtaatacctc agccataaca caagcctgtc 6840 

caaaagtctc gtttgaccca attcctatac attattgtgc tccagctggt tatgcgcctc 6900 

taaaatgtaa taataagaaa ttcaatggaa taggaccatg cgataatgtc agcacagtac 6960 

aatgtacaca tggaattaag ccagtggtat caactcaatt actgttaaat ggtagcctag 7020 

cagaagaaga gataataatt agatctgaaa atctgacaaa caatgtcaaa acaataatag 7080 

tacatcttaa tgaatctata gagattaaat gtacaagacc tggcaataat acaagaaaga 7140 

gtgtgagaat aggaccagga caagcattct atgcaacagg agacataata ggagatataa 7200 

gacaagcaca ttgtaacatt agtaaaaatg aatggaatac aactttacaa agggtaagtc 7260 

aaaaattaca agaactcttc cctaatagta cagggataaa atttgcacca cactcaggag 7320 

gggacctaga aattactaca catagcttta attgtggagg agaatttttc tattgcaata 7380 

caacagacct gtttaatagt acatacagta atggtacatg cactaatggt acatgcatgt 7440 

ctaataatac agagcgcatc acactccaat gcagaataaa acaaattata aacatgtggc 7500 

aggaggtagg acgagcaatg tatgcccctc ccattgcagg aaacataaca tgtagatcaa 7560 

atattacagg actactatta acacgtgatg gaggagataa taatactgaa acagagacat 7620 

tcagacctgg aggaggagac atgagggaca attggagaag tgaattatat aaatacaagg 7 680 

tggtagaaat taaaccatta ggagtagcac ccactgctgc aaaaaggaga gtggtggaga 7740 

gagaaaaaag agcagtagga ataggagctg tgttccttgg gttcttggga gcagcaggaa 7800 

gcactatggg cgcagcatca ataacgctga cggtacaggc cagacaatta ttgtctggta 7860 

tagtgcaaca gcaaagtaat ttgctgaggg ctatagaggc gcaacagcat atgttgcaac 7920 

tcacggtctg gggcattaag cagctccagg caagagtcct ggctatagag agatacctac 7980 
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aggatcaaca gctcctagga ctgtggggct gctctggaaa actcatctgc accactaatg 8040 

tgctttggaa ctctagttgg agtaataaaa ctcaaagtga tatttgggat aacatgacct 8100 

ggatgcagtg ggatagggaa attagtaatt acacaaacac aatatacagg ttgcttgaag 8160 

actcgcaaag ccagcaggaa agaaatgaaa aagatttact agcattggac aggtggaaca 8220 

atctgtggaa ttggtttagc ataacaaatt ggctgtggta tataaaaata ttcataatga 8280 

tagtaggagg cttgataggt ttaagaataa tttttgctgt gctctctcta gtaaatagag 8340 

ttaggcaggg atactcaccc ttgtcattgc agacccttat cccaaacccg aggggacccg 8400 

acaggctcgg aggaatcgaa gaagaaggtg gagagcaaga cagcagcaga tccattcgat 8460 

tagtgagcgg attcttgaca cttgcctggg acgacctacg aagcctgtgc ctcttctgct 8520 

accaccgatt gagagacttc atattaattg tagtgagagc agtggaactt ctgggacaca 8580 

gtagtctcag gggactgcag agggggtggg gaacccttaa gtatttgggg agtcttgtgc 8640 

aatattgggg tctagagtta aaaaagagtg ctattaatct gcttgatact atagcaatag 8700 

cagtagctga aggaacagat aggattctag aattcataca aaacctttgt agaggtatcc 8760 

gcaacgtacc tagaagaata agacagggct tcgaagcagc tttgcaataa aatggggggc 8820 

aagtggtcaa aaagcagtat aattggatgg cctgaagtaa gagaaagaat cagacgaact 8880 

aggtcagcag cagagggagt aggatcagcg tctcaagact tagagaaaca tggggcactt 8940 

acaaccagca acacagccca caacaatgct gcttgcgcct ggctggaagc gcaagaggag 9000 

gaaggagaag taggctttcc agtcagacct caggtacctt taagaccaat gacttataaa 9060 

gcagcaatag atctcagctt ctttttaaaa gaaaaggggg gactggaagg gttaatttac 9120 

tccaagaaaa ggcaagagat ccttgatttg tgggtttata acacacaagg cttcttccct 9180 

gattggcaaa actacacacc gggaccaggg gtcagatttc cactgacctt tggatggtac 9240 

ttcaagctag agccagtcga tccaagggaa gtagaagagg ccaatgaagg agaaaacaac 9300 

tgtttactac accctatgag ccagcatgga atggaggatg aagacagaga agtattaaga 9360 

tggaagtttg acagtacgct agcacgcaga cacatggccc gcgagctaca tccggagtat 9420 

tacaaagact gctgacacag aagggacttt ccgctgggac tttccactgg ggcgttccag 9480 

gaggtgtggt ctgggcggga caggggagtg gtcagccctg agatgctgca tataagcagc 9540 

tgcttttcgc ctgtactggg tctctctagg tagaccagat ctgagcccgg gagctctctg 9600 

gctatctagg gaacccactg cttaagcctc aataaagctt gccttgagtg ccttgagtag 9660 

tgtgtgcccg tctgttgtgt gactctggta actagagatc cctcagacca cttgtggtag 9720 
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tgtggaaaat ctctagca 



<210> 46 

<211> 97 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C Env Optimized common region short 
<400> 46 

catcaccctg cagtgcaaga tcaagcagat cgtgcgcatg tggcagggcg tgggccaggc 
catgtacgcc ccccccatcg ccggcaacat cacctgc 



<210> 47 

<211> 144 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C Env Optimized common region 
<400> 47 

ctgcccatca ccctgcagtg caagatcaag cagatcgtgc gcatgtggca gggcgtgggc 
caggccatgt acgccccccc catcgccggc aacatcacct gccgcagcaa catcaccggc 
atcctgctga cccgcgacgg cggc 



<210> 48 

<211> 144 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C Env wild type common region 
<400> 48 

ttacccatca cactccaatg caaaataaaa caaattgtac gcatgtggca aggggtagga 
caagcaatgt atgcccctcc cattgcagga aacataacat gtagatcaaa catcacagga 
atactattga cacgtgatgg ggga 



<210> 49 

<211> 2610 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C Envgpl60 optimized 
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<400> 49 

atgcgcgtga tgggcaccca gaagaactgc cagcagtggt ggatctgggg catcctgggc 60 

ttctggatgc tgatgatctg caacaccgag gacctgtggg tgaccgtgta ctacggcgtg 12 0 

cccgtgtggc gcgaggccaa gaccaccctg ttctgcgcca gcgacgccaa ggcctacgag 180 

accgaggtgc acaacgtgtg ggccacccac gcctgcgtgc ccaccgaccc caacccccag 240 

gagatcgtgc tgggcaacgt gaccgagaac ttcaacatgt ggaagaacaa catggccgac 300 

cagatgcacg aggacatcat cagcctgtgg gaccagagcc tgaagccctg cgtgaagctg 360 

acccccctgt gcgtgaccct gaactgcacc gacaccaacg tgaccggcaa ccgcaccgtg 420 

accggcaaca ccaacgacac caacatcgcc aacgccacct acaagtacga ggagatgaag 480 

aactgcagct tcaacgccac caccgagctg cgcgacaaga agcacaagga gtacgccctg 540 

ttctacaagc tggacatcgt gcccctgaac gagaacagca acaacttcac ctaccgcctg 600 

atcaactgca acaccagcac catcacccag gcctgcccca aggtgagctt cgaccccatc 660 

cccatccact actgcgcccc cgccgactac gccatcctga agtgcaacaa caagaccttc 720 

aacggcaccg gcccctgcta caacgtgagc accgtgcagt gcacccacgg catcaagccc 780 

gtggtgagca cccagctgct gctgaacggc agcctggccg aggagggcat catcatccgc 840 

agcgagaacc tgaccgagaa caccaagacc atcatcgtgc acctgaacga gagcgtggag 900 

atcaactgca cccgccccaa caacaacacc cgcaagagcg tgcgcatcgg ccccggccag 960 

gccttctacg ccaccaacga cgtgatcggc aacatccgcc aggcccactg caacatcagc 1020 

accgaccgct ggaacaagac cctgcagcag gtgatgaaga agctgggcga gcacttcccc 1080 

aacaagacca tcaagttcga gccccacgcc ggcggcgacc tggagatcac catgcacagc 1140 

ttcaactgcc gcggcgagtt cttctactgc aacaccagca acctgttcaa cagcacctac 1200 

taccccaaga acggcaccta caagtacaac ggcaacagca gcctgcccat caccctgcag 1260 

tgcaagatca agcagatcgt gcgcatgtgg cagggcgtgg gccaggccat gtacgccccc 1320 

cccatcgccg gcaacatcac ctgccgcagc aacatcaccg gcatcctgct gacccgcgac 1380 

ggcggcttca acaacaccaa caacgacacc gaggagacct tccgccccgg cggcggcgac 1440 

atgcgcgaca actggcgcag cgagctgtac aagtacaagg tggtggagat caagcccctg 1500 

ggcatcgccc ccaccaaggc caagcgccgc gtggtgcagc gcaagaagcg cgccgtgggc 1560 

atcggcgccg tgttcctggg cttcctgggc gccgccggca gcaccatggg cgccgccagc 1620 

atcaccctga ccgtgcaggc ccgccagctg ctgagcggca tcgtgcagca gcagagcaac 1680 
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ctgctgaagg ccatcgaggc ccagcagcac atgctgcagc tgaccgtgtg gggcatcaag 1740 

cagctgcagg cccgcgtgct ggccatcgag cgctacctga aggaccagca gctgctgggc 1800 

atctggggct gcagcggccg cctgatctgc accaccgccg tgccctggaa cagcagctgg 1860 

agcaacaaga gcgaggccga catctgggac aacatgacct ggatgcagtg ggaccgcgag 1920 

atcaacaact acaccgagac catcttccgc ctgctggagg acagccagaa ccagcaggag 1980 

aagaacgaga aggacctgct ggagctggac aagtggaaca acctgtggaa ctggttcgac 2040 

atcagcaact ggctgtggta catcaagatc ttcatcatga tcgtgggcgg cctgatcggc 2100 

ctgcgcatca tcttcgccgt gctgagcatc gtgaaccgcg tgcgccaggg ctacagcccc 2160 

ctgagcttcc agaccctgac ccccagcccc cgcggcctgg accgcctggg cggcatcgag 2220 

gaggagggcg gcgagcagga ccgcgaccgc agcatccgcc tggtgagcgg cttcctgagc 2280 

ctggcctggg acgacctgcg cagcctgtgc ctgttcagct accaccgcct gcgcgacttc 2340 

atcctgatcg ccgtgcgcgc cgtggagctg ctgggccaca gcagcctgcg cggcctgcag 2400 

cgcggctggg agatcctgaa gtacctgggc agcctggtgc agtactgggg cctggagctg 2460 

aagaagagcg ccatcagccc cctggacacc atcgccatcg ccgtggccga gggcaccgac 2520 

cgcatcatcg agctggtgca gcgcatctgc cgcgccatcc tgaacatccc ccgccgcatc 2580 

cgccagggct tcgaggccgc cctgctgtaa 2610 

<210> 50 
<211> 2610 
<212> DNA 
<213> Artificial 

<220> 

<223> Type C Envgpl60 wild type 
<400> 50 

atgagagtga tggggacaca gaagaattgt caacaatggt ggatatgggg catcttaggc 60 

ttctggatgc taatgatttg taacacggag gacttgtggg tcacagtcta ctatggggta 120 

cctgtgtgga gagaagcaaa aactactcta ttctgtgcat cagatgctaa agcatatgag 180 

acagaagtgc ataatgtctg ggctacacat gcttgtgtac ccacagaccc caacccacaa 240 

gaaatagttt tgggaaatgt aacagaaaat tttaatatgt ggaaaaataa catggcagat 300 

cagatgcatg aggatataat cagtttatgg gatcaaagcc taaagccatg tgtaaagttg 360 

accccactct gtgtcacttt aaactgtaca gatacaaatg ttacaggtaa tagaactgtt 420 
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acaggtaata caaatgatac caatattgca aatgctacat ataagtatga agaaatgaaa 480 

aattgctctt tcaatgcaac cacagaatta agagataaga aacataaaga gtatgcactc 540 

ttttataaac ttgatatagt accacttaat gaaaatagta acaactttac atatagatta 600 

ataaattgca atacctcaac cataacacaa gcctgtccaa aggtctcttt tgacccgatt 660 

cctatacatt actgtgctcc agctgattat gcgattctaa agtgtaataa taagacattc 720 

aatgggacag gaccatgtta taatgtcagc acagtacaat gtacacatgg aattaagcca 780 

gtggtatcaa ctcaactact gttaaatggt agtctagcag aagaagggat aataattaga 840 

tctgaaaatt tgacagagaa taccaaaaca ataatagtac atcttaatga atctgtagag 900 

attaattgta caaggcccaa caataataca aggaaaagtg taaggatagg accaggacaa 960 

gcattctatg caacaaatga cgtaatagga aacataagac aagcacattg taacattagt 1020 

acagatagat ggaataaaac tttacaacag gtaatgaaaa aattaggaga gcatttccct 1080 

aataaaacaa taaaatttga accacatgca ggaggggatc tagaaattac aatgcatagc 1140 

tttaattgta gaggagaatt tttctattgc aatacatcaa acctgtttaa tagtacatac 1200 

taccctaaga atggtacata caaatacaat ggtaattcaa gcttacccat cacactccaa 1260 

tgcaaaataa aacaaattgt acgcatgtgg caaggggtag gacaagcaat gtatgcccct 1320 

cccattgcag gaaacataac atgtagatca aacatcacag gaatactatt gacacgtgat 1380 

gggggattta acaacacaaa caacgacaca gaggagacat tcagacctgg aggaggagat 1440 

atgagggata actggagaag tgaattatat aaatataaag tggtagaaat taagccattg 1500 

ggaatagcac ccactaaggc aaaaagaaga gtggtgcaga gaaaaaaaag agcagtggga 1560 

ataggagctg tgttccttgg gttcttggga gcagcaggaa gcactatggg cgcagcgtca 1620 

ataacgctga cggtacaggc cagacaactg ttgtctggta tagtgcaaca gcaaagcaat 1680 

ttgctgaagg ctatagaggc gcaacagcat atgttgcaac tcacagtctg gggcattaag 1740 

cagctccagg cgagagtcct ggctatagaa agatacctaa aggatcaaca gctcctaggg 1800 

atttggggct gctctggaag actcatctgc accactgctg tgccttggaa ctccagttgg 1860 

agtaataaat ctgaagcaga tatttgggat aacatgactt ggatgcagtg ggatagagaa 1920 

attaataatt acacagaaac aatattcagg ttgcttgaag actcgcaaaa ccagcaggaa 1980 

aagaatgaaa aagatttatt agaattggac aagtggaata atctgtggaa ttggtttgac 2040 

atatcaaact ggctgtggta tataaaaata ttcataatga tagtaggagg cttgataggt 2100 

ttaagaataa tttttgctgt gctctctata gtgaatagag ttaggcaggg atactcacct 2160 
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ttgtcatttc agacccttac cccaagcccg aggggactcg acaggctcgg aggaatcgaa 2220 

gaagaaggtg gagagcaaga cagagacaga tccatacgat tggtgagcgg attcttgtcg 2280 

cttgcctggg acgatctgcg gagcctgtgc ctcttcagct accaccgctt gagagacttc 2340 

atattaattg cagtgagggc agtggaactt ctgggacaca gcagtctcag gggactacag 2400 

agggggtggg agatccttaa gtatctggga agtcttgtgc agtattgggg tctagagcta 2460 

aaaaagagtg ctattagtcc gcttgatacc atagcaatag cagtagctga aggaacagat 2520 

aggattatag aattggtaca aagaatttgt agagctatcc tcaacatacc taggagaata 2580 

agacagggct ttgaagcagc tttgctataa 2610 

<210> 51 
<211> 1494 
<212> DNA 
<213> Artificial 

<220> 

<223> HIV Type C Gag optimized 
<400> 51 

atgggcgccc gcgccagcat cctgagcggc ggcaagctgg acaagtggga gcgcatccgc 60 

ctgcgccccg gcggcaagaa gcactacatg ctgaagcacc tggtgtgggc cagccgcgag 120 

ctggagcgct tcgccctgaa ccccggcctg ctggagacca gcgagggctg caagcagatc 180 

atcaagcagc tgcagcccgc cctgcagacc ggcaccgagg agctgcgcag cctgttcaac 240 

accgtggcca ccctgtactg cgtgcacaag ggcatcgagg tgcgcgacac caaggaggcc 300 

ctggacaaga tcgaggagga gcagaacaag tgccagcaga aggcccagca ggccaaggcc 360 

gccgacgaga aggtgagcca gaactacccc atcgtgcaga acgcccaggg ccagatggtg 420 

caccaggcca tcagcccccg caccctgaac gcctggatca aggtgatcga ggagaaggcc 480 

ttcaaccccg aggagatccc catgttcacc gccctgagcg agggcgccac cccccaggac 540 

ctgaacacca tgctgaacac cgtgggcggc caccaggccg ccatgcagat gctgaaggac 600 

accatcaacg aggaggccgc cgagtgggac cgcacccacc ccgtgcacgc cggccccgtg 660 

gcccccggcc agatgcgcga gccccgcggc agcgacatcg ccggcaccac cagcaccctg 720 

caggagcaga tcgcctggat gaccagcaac ccccccatcc ccgtggagga catctacaag 780 

cgctggatca tcctgggcct gaacaagatc gtgcgcatgt acagccccgt gagcatcctg 840 

gacatcaagc agggccccaa ggagcccttc cgcgactacg tggaccgctt cttcaagacc 900 
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ctgcgcgccg agcaggccac ccaggacgtg aagaactgga tgaccgacac cctgctggtg 960 

cagaacgcca accccgactg caagaccatc ctgcgcgccc tgggccccgg cgccagcctg 1020 

gaggagatga tgaccgcctg ccagggcgtg ggcggcccca gccacaaggc ccgcgtgctg 1080 

gccgaggcca tgagccaggc caacagcaac atcctggtgc agcgcagcaa cttcaagggc 1140 

agcaaccgca tcatcaagtg cttcaactgc ggcaaggtgg gccacatcgc ccgcaactgc 1200 

cgcgcccccc gcaagaaggg ctgctggaag tgcggccagg agggccacca gatgaaggac 1260 

tgcaccgagc gccaggccaa cttcctgggc aagatctggc ccagccacaa gggccgcccc 1320 

ggcaacttcc tgcagaaccg ccccgagccc accgcccccc ccgccgagcc caccgccccc 1380 

cccgccgaga gcttccgctt cgaggagacc acccccgtgc cccgcaagga gaaggagcgc 1440 

gagcccctga ccagcctgaa gagcctgttc ggcagcgacc ccctgagcca gtaa 1494 

<210> 52 
<211> 1494 
<212> DNA 
<213> Artificial 

<220> 

<223> HIV Type C Gag Wild Type 
<400> 52 

atgggtgcga gagcgtcaat attaagcggc ggaaaattag ataaatggga aagaattagg 60 

ttaaggccag ggggaaagaa acattatatg ttaaaacatc tagtatgggc aagcagggag 12 0 

ctggaaagat ttgcacttaa ccctggcctg ttagaaacat cagaaggctg taaacaaata 180 

ataaaacagc tacaaccagc tcttcagaca ggaacagagg aacttagatc attattcaac 240 

acagtagcaa ctctctattg tgtacataaa gggatagagg tacgagacac caaggaagcc 300 

ttagacaaga tagaggaaga acaaaacaaa tgtcagcaaa aagcacaaca ggcaaaagca 360 

gctgacgaaa aggtcagtca aaattatcct atagtacaga atgcccaagg gcaaatggta 420 

caccaagcta tatcacctag aacattgaat gcatggataa aagtaataga ggaaaaggct 480 

ttcaatccag aggaaatacc catgtttaca gcattatcag aaggagccac cccacaagat 540 

ttaaacacaa tgttaaatac agtgggggga catcaagcag ccatgcaaat gttaaaagat 600 

accatcaatg aggaggctgc agaatgggat aggacacatc cagtacatgc agggcctgtt 660 

gcaccaggcc agatgagaga accaagggga agtgacatag caggaactac tagtaccctt 720 

caggaacaaa tagcatggat gacaagtaat ccacctattc cagtagaaga catctataaa 780 

agatggataa ttctggggtt aaataaaata gtaagaatgt atagccctgt tagcattttg 840 
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gacataaaac aagggccaaa agaacccttt agagactatg tagaccggtt ctttaaaacc 900 

ttaagagctg aacaagctac acaagatgta aagaattgga tgacagacac cttgttggtc 960 

caaaatgcga acccagattg taagaccatt ttaagagcat taggaccagg ggcctcatta 1020 

gaagaaatga tgacagcatg tcagggagtg ggaggaccta gccataaagc aagagtgttg 1080 

gctgaggcaa tgagccaagc aaacagtaac atactagtgc agagaagcaa ttttaaaggc 1140 

tctaacagaa ttattaaatg tttcaactgt ggcaaagtag ggcacatagc cagaaattgc 1200 

agggccccta ggaaaaaggg ctgttggaaa tgtggacagg aaggacacca aatgaaagac 1260 

tgtactgaga ggcaggctaa ttttttaggg aaaatttggc cttcccacaa ggggaggcca 1320 

gggaatttcc tccagaacag accagagcca acagccccac cagcagaacc aacagcccca 1380 

ccagcagaga gcttcaggtt cgaggagaca acccccgtgc cgaggaagga gaaagagagg 1440 

gaacctttaa cttccctcaa atcactcttt ggcagcgacc ccttgtctca ataa 1494 



<210> 53 

<211> 60 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C Gag Major Homology Region Optimized 

<400> 53 

gacatcaagc agggccccaa ggagcccttc cgcgactacg tggaccgctt cttcaagacc 60 



<210> 54 

<211> 60 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C Gag Major Homology Region Wild Type 

<400> 54 

gacataaaac aagggccaaa agaacccttt agagactatg tagaccggtt ctttaaaacc 60 



<210> 55 

<211> 624 

<212> DNA 

<213> Artificial 



<220> 

<223> HIV Type C Nef Optimized 
<400> 55 
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atgggcggca agtggagcaa gcgcagcatc gtgggctggc ccgccgtgcg cgagcgcatg 60 

cgccgcaccg agcccgccgc cgagggcgtg ggcgccgcca gccaggacct ggaccgccac 120 

ggcgccctga ccagcagcaa cacccccgcc accaacgagg cctgcgcctg gctgcaggcc 180 

caggaggagg acggcgacgt gggcttcccc gtgcgccccc aggtgcccct gcgccccatg 240 

acctacaaga gcgccgtgga cctgagcttc ttcctgaagg agaagggcgg cctggagggc 300 

ctgatctaca gccgcaagcg ccaggagatc ctggacctgt gggtgtacaa cacccagggc 360 

ttcttccccg actggcagaa ctacaccagc ggccccggcg tgcgcttccc cctgaccttc 420 

ggctggtgct tcaagctggt gcccgtggac ccccgcgagg tgaaggaggc caacgagggc 480 

gaggacaact gcctgctgca ccccatgagc cagcacggcg ccgaggacga ggaccgcgag 540 

gtgctgaagt ggaagttcga cagcctgctg gcccaccgcc acatggcccg cgagctgcac 600 

cccgagtact acaaggactg ctga 624 

<210> 56 
<211> 624 
<212> DNA 
<213> Artificial 

<220> 

<223> HIV Type C Nef Wild Type 
<400> 56 

atgggaggca agtggtcaaa acgcagcata gttggatggc ctgcagtaag agaaagaatg 60 

agaagaactg agccagcagc agagggagta ggagcagcgt ctcaagactt agatagacat 120 

ggggcactta caagcagcaa cacacctgct actaatgaag cttgtgcctg gctgcaagca 180 

caagaggagg acggagatgt aggctttcca gtcagacctc aggtaccttt aagaccaatg 240 

acttataaga gtgcagtaga tctcagcttc tttttaaaag aaaagggggg actggaaggg 300 

ttaatttact ctaggaaaag gcaagaaatc cttgatttgt gggtctataa cacacaaggc 360 

ttcttccctg attggcaaaa ctacacatcg gggccagggg tccgattccc actgaccttt 420 

ggatggtgct tcaagctagt accagttgac ccaagggagg tgaaagaggc caatgaagga 480 

gaagacaact gtttgctaca ccctatgagc caacatggag cagaggatga agatagagaa 540 

gtattaaagt ggaagtttga cagccttcta gcacacagac acatggcccg cgagctacat 600 

ccggagtatt acaaagactg ctga 624 



<210> 57 
<211> 624 
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<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C NefD125G Optimized 
<400> 57 

atgggcggca agtggagcaa gcgcagcatc gtgggctggc ccgccgtgcg cgagcgcatg 60 

cgccgcaccg agcccgccgc cgagggcgtg ggcgccgcca gccaggacct ggaccgccac 120 

ggcgccctga ccagcagcaa cacccccgcc accaacgagg cctgcgcctg gctgcaggcc 180 

caggaggagg acggcgacgt gggcttcccc gtgcgccccc aggtgcccct gcgccccatg 240 

acctacaaga gcgccgtgga cctgagcttc ttcctgaagg agaagggcgg cctggagggc 300 

ctgatctaca gccgcaagcg ccaggagatc ctggacctgt gggtgtacaa cacccagggc 360 

ttcttccccg gctggcagaa ctacaccagc ggccccggcg tgcgcttccc cctgaccttc 420 

ggctggtgct tcaagctggt gcccgtggac ccccgcgagg tgaaggaggc caacgagggc 480 

gaggacaact gcctgctgca ccccatgagc cagcacggcg ccgaggacga ggaccgcgag 540 

gtgctgaagt ggaagttcga cagcctgctg gcccaccgcc acatggcccg cgagctgcac 600 

cccgagtact acaaggactg ctga 624 

<210> 58 

<211> 354 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C pl5RNaseH Optimized 
<400> 58 

accttctacg tggacggcgc caccaaccgc gaggccaaga tcggcaaggc cggctacgtg 60 

accgaccgcg gccgccagaa gatcgtgacc ctgaccaaca ccaccaacca gaagaccgag 120 

ctgcaggcca tccagctggc cctgcaggac agcggcagcg aggtgaacat cgtgaccgac 180 

agccagtacg ccctgggcat catccaggcc cagcccgaca agagcgacag cgagatcttc 240 

aaccagatca tcgagcagct gatcaacaag gagcgcatct acctgagctg ggtgcccgcc 300 

cacaagggca tcggcggcaa cgagcaggtg gacaagctgg tgagcaaggg catc 354 



<210> 59 

<211> 354 

<212> DNA 

<213> Artificial 
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<220> 

<223> HIV Type C plSRNaseH Wild Type 
<400> 59 

actttctatg tagatggagc aactaatagg gaagctaaaa taggaaaagc agggtatgtt 60 

actgacagag gaaggcagaa aattgttact ctaactaaca caacaaatca gaagactgag 120 

ttacaagcaa ttcagctagc tctgcaggat tcaggatcag aagtaaacat agtaacagac 180 

tcacagtatg cattaggaat cattcaagca caaccagata agagtgactc agagatattt 240 

aaccaaataa tagaacagtt aataaacaag gaaagaatct acctgtcatg ggtaccagca 300 

cataaaggaa ttgggggaaa tgaacaagta gataaattag taagtaaggg aatt 354 

<210> 60 
<211> 876 
<212> DNA 
<213> Artificial 

<220> 

<223> HIV Type C p31Int Optimized 
<400> 60 

cgcaaggtgc tgttcctgga cggcatcgac aaggcccagg aggagcacga gcgctaccac 60 

agcaactggc gcgccatggc caacgagttc aacctgcccc ccatcgtggc caaggagatc 120 

gtggccagct gcgacaagtg ccagctgaag ggcgaggcca tccacggcca ggtggactgc 180 

agccccggca tctggcagct ggactgcacc cacctggagg gcaagatcat cctggtggcc 240 

gtgcacgtgg ccagcggcta catggaggcc gaggtgatcc ccgccgagac cggccaggag 300 

accgcctact tcatcctgaa gctggccggc cgctggcccg tgaaggtgat ccacaccgac 360 

aacggcagca acttcaccag caccgccgtg aaggccgcct gctggtgggc cggcatccag 420 

caggagttcg gcatccccta caacccccag agccagggcg tggtggagag catgaacaag 480 

gagctgaaga agatcatcgg ccaggtgcgc gaccaggccg agcacctgaa gaccgccgtg 540 

cagatggccg tgttcatcca caacttcaag cgcaagggcg gcatcggcgg ctacagcgcc 600 

ggcgagcgca tcatcgacat catcgccacc gacatccaga ccaaggagct gcagaagcag 660 

atcatccgca tccagaactt ccgcgtgtac taccgcgaca gccgcgaccc catctggaag 720 

ggccccgccg agctgctgtg gaagggcgag ggcgtggtgg tgatcgagga caagggcgac 780 

atcaaggtgg tgccccgccg caaggccaag atcatccgcg actacggcaa gcagatggcc 840 

ggcgccgact gcgtggccgg cggccaggac gaggac 876 
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<210> 61 

<211> 876 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C p31Int Wild Type 
<400> 61 

aggaaagtgt tgtttctaga tggaatagat aaagctcaag aagagcatga aaggtaccac 60 

agcaattgga gagcaatggc taatgagttt aatctgccac ccatagtagc aaaagaaata 120 

gtagctagct gtgataaatg tcagctaaaa ggggaagcca tacatggaca agtcgactgt 180 

agtccaggga tatggcaatt agattgtacc catttagagg gaaaaatcat cctggtagca 240 

gtccatgtag ctagtggcta catggaagca gaggttatcc cagcagaaac aggacaagaa 300 

acagcatatt ttatattaaa attagcagga agatggccag tcaaagtaat acatacagac 3 60 

aatggcagta attttaccag tactgcagtt aaggcagcct gttggtgggc aggtatccaa 420 

caggaatttg gaattcccta caatccccaa agtcagggag tggtagaatc catgaataaa 480 

gaattaaaga aaataatagg acaagtaaga gatcaagctg agcaccttaa gacagcagta 540 

caaatggcag tattcattca caattttaaa agaaaagggg gaattggggg gtacagtgca 600 

ggggaaagaa taatagacat aatagcaaca gacatacaaa ctaaagaatt acaaaaacaa 660 

attataagaa ttcaaaattt tcgggtttat tacagagaca gcagagaccc tatttggaaa 720 

ggaccagccg aactactctg gaaaggtgaa ggggtagtag taatagaaga taaaggtgac 780 

ataaaggtag taccaaggag gaaagcaaaa atcattagag attatggaaa acagatggca 840 

ggtgctgatt gtgtggcagg tggacaggat gaagat 876 

<210> 62 

<211> 3015 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C Pol Optimized 
<400> 62 

ttcttccgcg agaacctggc cttcccccag ggcgaggccc gcgagttccc ccccgagcag 60 
acccgcgcca acagccccac cagccgcacc aacagcccca ccagccgcga gctgcaggtg 120 
cgcggcgaca acccccgcgc cgaggagggc gagcgcgagg gcaccttcaa cttcccccag 180 
atcaccctgt ggcagcgccc cctggtgagc atcaaggtgg agggccagat caaggaggcc 240 
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ctgctggaca ccggcgccga cgacaccgtg ctggaggaga tcgacctgcc cggcaagtgg 300 

aagcccaaga tgatcggcgg catcggcggc ttcatcaagg tgcgccagta cgaccagatc 360 

ctgatcgaga tctgcggcaa gaaggccatc ggcaccgtgc tggtgggccc cacccccgtg 420 

aacatcatcg gccgcaacct gctgacccag ctgggctgca ccctgaactt ccccatcagc 480 

cccatcgaga ccgtgcccgt gaagctgaag cccggcatgg acggccccaa ggtgaagcag 540 

tggcccctga ccgaggagaa gatcaaggcc ctgaccgcca tctgcgagga gatggagaag 600 

gagggcaaga tcaccaagat cggccccgac aacccctaca acacccccgt gttcgccatc 660 

aagaagaagg acagcaccaa gtggcgcaag ctggtggact tccgcgagct gaacaagcgc 720 

acccaggact tctgggaggt gcagctgggc atcccccacc ccgccggcct gaagaagaag 780 

aagagcgtga ccgtgctgga cgtgggcgac gcctacttca gcgtgcccct ggacgagagc 840 

ttccgcaagt acaccgcctt caccatcccc agcatcaaca acgagacccc cggcatccgc 900 

taccagtaca acgtgctgcc ccagggctgg aagggcagcc ccgccatctt ccagagcagc 960 

atgaccaaga tcctggagcc cttccgcgcc aagaaccccg acatcgtgat ctaccagtac 1020 

atggacgacc tgtacgtggg cagcgacctg gagatcggcc agcaccgcgc caagatcgag 1080 

gagctgcgcg agcacctgct gaagtggggc ttcaccaccc ccgacaagaa gcaccagaag 1140 

gagcccccct tcctgtggat gggctacgag ctgcaccccg acaagtggac cgtgcagccc 1200 

atcctgctgc ccgagaagga cagctggacc gtgaacgaca tccagaagct ggtgggcaag 1260 

ctgaactggg ccagccagat ctaccccggc atcaaggtgc gccagctgtg caagctgctg 1320 

cgcggcgcca aggccctgac cgacatcgtg cccctgaccg aggaggccga gctggagctg 1380 

gccgagaacc gcgagatcct gcgcgagccc gtgcacggcg tgtactacga ccccagcaag 1440 

gacctgatcg ccgagatcca gaagcagggc cacgagcagt ggacctacca gatctaccag 1500 

gagcccttca agaacctgaa gaccggcaag tacgccaaga tgcgcaccac ccacaccaac 1560 

gacgtgaagc agctgaccga ggccgtgcag aagatcgcca tggagagcat cgtgatctgg 1620 

ggcaagaccc ccaagttccg cctgcccatc cagaaggaga cctgggagac ctggtggacc 1680 

gactactggc aggccacctg gatccccgag tgggagttcg tgaacacccc ccccctggtg 1740 

aagctgtggt accagctgga gaaggacccc atcgccggcg tggagacctt ctacgtggac 1800 

ggcgccacca accgcgaggc caagatcggc aaggccggct acgtgaccga ccgcggccgc 1860 

cagaagatcg tgaccctgac caacaccacc aaccagaaga ccgagctgca ggccatccag 192 0 

ctggccctgc aggacagcgg cagcgaggtg aacatcgtga ccgacagcca gtacgccctg 1980 
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ggcatcatcc aggcccagcc cgacaagagc gacagcgaga tcttcaacca gatcatcgag 2040 

cagctgatca acaaggagcg catctacctg agctgggtgc ccgcccacaa gggcatcggc 2100 

ggcaacgagc aggtggacaa gctggtgagc aagggcatcc gcaaggtgct gttcctggac 2160 

ggcatcgaca aggcccagga ggagcacgag cgctaccaca gcaactggcg cgccatggcc 2220 

aacgagttca acctgccccc catcgtggcc aaggagatcg tggccagctg cgacaagtgc 2280 

cagctgaagg gcgaggccat ccacggccag gtggactgca gccccggcat ctggcagctg 2340 

gactgcaccc acctggaggg caagatcatc ctggtggccg tgcacgtggc cagcggctac 2400 

atggaggccg aggtgatccc cgccgagacc ggccaggaga ccgcctactt catcctgaag 2460 

ctggccggcc gctggcccgt gaaggtgatc cacaccgaca acggcagcaa cttcaccagc 2 520 

accgccgtga aggccgcctg ctggtgggcc ggcatccagc aggagttcgg catcccctac 2580 

aacccccaga gccagggcgt ggtggagagc atgaacaagg agctgaagaa gatcatcggc 2640 

caggtgcgcg accaggccga gcacctgaag accgccgtgc agatggccgt gttcatccac 2700 

aacttcaagc gcaagggcgg catcggcggc tacagcgccg gcgagcgcat catcgacatc 2760 

atcgccaccg acatccagac caaggagctg cagaagcaga tcatccgcat ccagaacttc 2820 

cgcgtgtact accgcgacag ccgcgacccc atctggaagg gccccgccga gctgctgtgg 2880 

aagggcgagg gcgtggtggt gatcgaggac aagggcgaca tcaaggtggt gccccgccgc 2940 

aaggccaaga tcatccgcga ctacggcaag cagatggccg gcgccgactg cgtggccggc 3000 

ggccaggacg aggac 3015 

<210> 63 
<211> 3015 
<212> DNA 
<213> Artificial 

<220> 

<223> HIV Type C Pol Wild Type 
<400> 63 

ttttttaggg aaaatttggc cttcccacaa ggggaggcca gggaatttcc tccagaacag 60 

accagagcca acagccccac cagcagaacc aacagcccca ccagcagaga gcttcaggtt 120 

cgaggagaca acccccgtgc cgaggaagga gaaagagagg gaacctttaa cttccctcaa 180 

atcactcttt ggcagcgacc ccttgtctca ataaaagtag agggccagat aaaggaggct 240 

ctcttagaca caggagcaga tgatacagta ttagaagaaa tagatttgcc agggaaatgg 300 
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aaaccaaaaa tgataggggg aattggaggt tttatcaaag taagacagta tgatcaaata 360 

cttatagaaa tttgtggaaa aaaggctata ggtacagtat tagtagggcc tacaccagtc 420 

aacataattg gaagaaatct gttaactcag cttggatgca cactaaattt tccaattagt 480 

cctattgaaa ctgtaccagt aaaattaaaa ccaggaatgg atggcccaaa ggtcaaacaa 540 

tggccattga cagaagaaaa aataaaagca ttaacagcaa tttgtgagga aatggagaag 600 

gaaggaaaaa ttacaaaaat tgggcctgat aatccatata acactccagt atttgccata 660 

aaaaagaagg acagtactaa gtggagaaaa ttagtagatt tcagggaact caataaaaga 720 

actcaagact tttgggaagt tcaattagga ataccacacc cagcaggatt aaaaaagaaa 780 

aaatcagtga cagtgctaga tgtgggggat gcatattttt cagttccttt agatgaaagc 840 

ttcaggaaat atactgcatt caccatacct agtataaaca atgaaacacc agggattaga 900 

tatcaatata atgtgctgcc acagggatgg aaaggatcac cagcaatatt ccagagtagc 960 

atgacaaaaa tcttagagcc cttcagagca aaaaatccag acatagttat ctatcaatat 1020 

atggatgact tgtatgtagg atctgactta gaaatagggc aacatagagc aaaaatagaa 1080 

gagttaaggg aacatttatt gaaatgggga tttacaacac cagacaagaa acatcaaaaa 1140 

gaacccccat ttctttggat ggggtatgaa ctccatcctg acaaatggac agtacaacct 1200 

atactgctgc cagaaaagga tagttggact gtcaatgata tacagaagtt agtgggaaaa 1260 

ttaaactggg caagtcagat ttacccaggg attaaagtaa ggcaactctg taaactcctc 1320 

aggggggcca aagcactaac agacatagta ccactaactg aagaagcaga attagaattg 1380 

gcagagaaca gggaaatttt aagagaacca gtacatggag tatattatga tccatcaaaa 1440 

gacttgatag ctgaaataca gaaacagggg catgaacaat ggacatatca aatttatcaa 1500 

gaaccattta aaaatctgaa aacagggaag tatgcaaaaa tgaggactac ccacactaat 1560 

gatgtaaaac agttaacaga ggcagtgcaa aaaatagcca tggaaagcat agtaatatgg 162 0 

ggaaagactc ctaaatttag actacccatc caaaaagaaa catgggagac atggtggaca 1680 

gactattggc aagccacctg gatccctgag tgggagtttg ttaatacccc tcccctagta 1740 

aaattatggt accaactaga aaaagatccc atagcaggag tagaaacttt ctatgtagat 1800 

ggagcaacta atagggaagc taaaatagga aaagcagggt atgttactga cagaggaagg 1860 

cagaaaattg ttactctaac taacacaaca aatcagaaga ctgagttaca agcaattcag 1920 

ctagctctgc aggattcagg atcagaagta aacatagtaa cagactcaca gtatgcatta 1980 

ggaatcattc aagcacaacc agataagagt gactcagaga tatttaacca aataatagaa 2040 
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cagttaataa 


acaaggaaag 


aatctacctg 


tcatgggtac 


cagcacataa 


aggaattggg 


2100 


ggaaatgaac 


aagtagataa 


attagtaagt 


aagggaatta 


ggaaagtgtt 


gtttctagat 


2160 


ggaatagata 


aagctcaaga 


agagcatgaa 


aggtaccaca 


gcaattggag 


agcaatggct 


2220 


aatgagttta 


atctgccacc 


catagtagca 


aaagaaatag 


tagctagctg 


tgataaatgt 


2280 


cagctaaaag 


gggaagccat 


acatggacaa 


gtcgactgta 


gtccagggat 


atggcaatta 


2340 


gattgtaccc 


atttagaggg 


aaaaatcatc 


ctggtagcag 


tccatgtagc 


tagtggctac 


2400 


atggaagcag 


aggttatccc 


agcagaaaca 


ggacaagaaa 


cagcatattt 


tatattaaaa 


2460 


ttagcaggaa 


gatggccagt 


caaagtaata 


catacagaca 


atggcagtaa 


ttttaccagt 


2520 


actgcagtta 


aggcagcctg 


ttggtgggca 


ggtatccaac 


aggaatttgg 


aattccctac 


2580 


aatccccaaa 


gtcagggagt 


ggtagaatcc 


atgaataaag 


aattaaagaa 


aataatagga 


2640 


caagtaagag 


atcaagctga 


gcaccttaag 


acagcagtac 


aaatggcagt 


attcattcac 


2700 


aattttaaaa 


gaaaaggggg 


aattgggggg 


tacagtgcag 


gggaaagaat 


aatagacata 


2760 


atagcaacag 


acatacaaac 


taaagaatta 


caaaaacaaa 


ttataagaat 


tcaaaatttt 


2820 


cgggtttatt 


acagagacag 


cagagaccct 


atttggaaag 


gaccagccga 


actactctgg 


2880 


aaaggtgaag 


gggtagtagt 


aatagaagat 


aaaggtgaca 


taaaggtagt 


accaaggagg 


2940 


aaagcaaaaa 


tcattagaga 


ttatggaaaa 


cagatggcag 


gtgctgattg 


tgtggcaggt 


3000 


ggacaggatg 


aagat 










3015 



<210> 64 

<211> 297 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C Protease Optimized 
<400> 64 

ccccagatca ccctgtggca gcgccccctg gtgagcatca aggtggaggg ccagatcaag 60 
gaggccctgc tggacaccgg cgccgacgac accgtgctgg aggagatcga cctgcccggc 120 
aagtggaagc ccaagatgat cggcggcatc ggcggcttca tcaaggtgcg ccagtacgac 180 
cagatcctga tcgagatctg cggcaagaag gccatcggca ccgtgctggt gggccccacc 240 
cccgtgaaca tcatcggccg caacctgctg acccagctgg gctgcaccct gaacttc 297 

<210> 65 
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<211> 297 
<212> DNA 
<213> Artificial 

<220> 

<223> HIV Type C Protease Wild Type 
<400> 65 

cctcaaatca ctctttggca gcgacccctt gtctcaataa aagtagaggg ccagataaag 60 
gaggctctct tagacacagg agcagatgat acagtattag aagaaataga tttgccaggg 120 
aaatggaaac caaaaatgat agggggaatt ggaggtttta tcaaagtaag acagtatgat 180 
caaatactta tagaaatttg tggaaaaaag gctataggta cagtattagt agggcctaca 240 
ccagtcaaca taattggaag aaatctgtta actcagcttg gatgcacact aaatttt 297 



<210> 66 

<211> 297 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C Inactivated Protease Optimized 
<400> 66 

ccccagatca ccctgtggca gcgccccctg gtgagcatca aggtggaggg ccagatcaag 60 
gaggccctgc tggccaccgg cgccgacgac accgtgctgg aggagatcga cctgcccggc 120 
aagtggaagc ccaagatgat cggcggcatc ggcggcttca tcaaggtgcg ccagtacgac 180 
cagatcctga tcgagatctg cggcaagaag gccatcggca ccgtgctggt gggccccacc 240 
cccgtgaaca tcatcggccg caacctgctg acccagctgg gctgcaccct gaacttc 297 



<210> 67 

<211> 297 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C Inactivated Protease Wild Type 
<400> 67 

cctcaaatca ctctttggca gcgacccctt gtctcaataa aagtagaggg ccagataaag 60 
gaggctctct tagccacagg agcagatgat acagtattag aagaaataga tttgccaggg 120 
aaatggaaac caaaaatgat agggggaatt ggaggtttta tcaaagtaag acagtatgat 180 
caaatactta tagaaatttg tggaaaaaag gctataggta cagtattagt agggcctaca 240 
ccagtcaaca taattggaag aaatctgtta actcagcttg gatgcacact aaatttt 297 
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<210> 68 
<211> 1965 
<212> DNA 
<213> Artificial 

<220> 

<223> HIV Type C Inactivated Protease Mutated Reverse Transcriptase 
Optimized 

<400> 68 

ccccagatca ccctgtggca gcgccccctg gtgagcatca aggtggaggg ccagatcaag 60 

gaggccctgc tggccaccgg cgccgacgac accgtgctgg aggagatcga cctgcccggc 120 

aagtggaagc ccaagatgat cggcggcatc ggcggcttca tcaaggtgcg ccagtacgac 180 

cagatcctga tcgagatctg cggcaagaag gccatcggca ccgtgctggt gggccccacc 240 

cccgtgaaca tcatcggccg caacctgctg acccagctgg gctgcaccct gaacttcccc 300 

atcagcccca tcgagaccgt gcccgtgaag ctgaagcccg gcatggacgg ccccaaggtg 360 

aagcagtggc ccctgaccga ggagaagatc aaggccctga ccgccatctg cgaggagatg 420 

gagaaggagg gcaagatcac caagatcggc cccgacaacc cctacaacac ccccgtgttc 480 

gccatcaaga agaaggacag caccaagtgg cgcaagctgg tggacttccg cgagctgaac 540 

aagcgcaccc aggacttctg ggaggtgcag ctgggcatcc cccaccccgc cggcctgaag 600 

aagaagaaga gcgtgaccgt gctggacgtg ggcgacgcct acttcagcgt gcccctggac 660 

gagagcttcc gcaagtacac cgccttcacc atccccagca tcaacaacga gacccccggc 720 

atccgctacc agtacaacgt gctgccccag ggctggaagg gcagccccgc catcttccag 780 

agcagcatga ccaagatcct ggagcccttc cgcgccaaga accccgacat cgtgatctac 840 

caggcccccc tgtacgtggg cagcgacctg gagatcggcc agcaccgcgc caagatcgag 900 

gagctgcgcg agcacctgct gaagtggggc ttcaccaccc ccgacaagaa gcaccagaag 960 

gagcccccct tcctgcccat cgagctgcac cccgacaagt ggaccgtgca gcccatcctg 1020 

ctgcccgaga aggacagctg gaccgtgaac gacatccaga agctggtggg caagctgaac 1080 

tgggccagcc agatctaccc cggcatcaag gtgcgccagc tgtgcaagct gctgcgcggc 1140 

gccaaggccc tgaccgacat cgtgcccctg accgaggagg ccgagctgga gctggccgag 1200 

aaccgcgaga tcctgcgcga gcccgtgcac ggcgtgtact acgaccccag caaggacctg 1260 

atcgccgaga tccagaagca gggccacgag cagtggacct accagatcta ccaggagccc 1320 

ttcaagaacc tgaagaccgg caagtacgcc aagatgcgca ccacccacac caacgacgtg 1380 
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aagcagctga ccgaggccgt gcagaagatc gccatggaga gcatcgtgat ctggggcaag 1440 

acccccaagt tccgcctgcc catccagaag gagacctggg agacctggtg gaccgactac 1500 

tggcaggcca cctggatccc cgagtgggag ttcgtgaaca ccccccccct ggtgaagctg 1560 

tggtaccagc tggagaagga ccccatcgcc ggcgtggaga ccttctacgt ggacggcgcc 1620 

accaaccgcg aggccaagat cggcaaggcc ggctacgtga ccgaccgcgg ccgccagaag 1680 

atcgtgaccc tgaccaacac caccaaccag aagaccgagc tgcaggccat ccagctggcc 1740 

ctgcaggaca gcggcagcga ggtgaacatc gtgaccgaca gccagtacgc cctgggcatc 1800 

atccaggccc agcccgacaa gagcgacagc gagatcttca accagatcat cgagcagctg 1860 

atcaacaagg agcgcatcta cctgagctgg gtgcccgccc acaagggcat cggcggcaac 1920 

gagcaggtgg acaagctggt gagcaagggc atccgcaagg tgctg 1965 

<210> 69 

<211> 1965 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C Inactivated Protease Mutated Reverse Transcriptase 
Wild Type 

<400> 69 



cctcaaatca 


ctctttggca 


gcgacccctt 


gtctcaataa 


aagtagaggg 


ccagataaag 


60 


gaggctctct 


tagccacagg 


agcagatgat 


acagtattag 


aagaaataga 


tttgccaggg 


120 


aaatggaaac 


caaaaatgat 


agggggaatt 


ggaggtttta 


tcaaagtaag 


acagtatgat 


180 


caaatactta 


tagaaatttg 


tggaaaaaag 


gctataggta 


cagtattagt 


agggcctaca 


240 


ccagtcaaca 


taattggaag 


aaatctgtta 


actcagcttg 


gatgcacact 


aaattttcca 


300 


attagtccta 


ttgaaactgt 


accagtaaaa 


ttaaaaccag 


gaatggatgg 


cccaaaggtc 


360 


aaacaatggc 


cattgacaga 


agaaaaaata 


aaagcattaa 


cagcaatttg 


tgaggaaatg 


420 


gagaaggaag 


gaaaaattac 


aaaaattggg 


cctgataatc 


catataacac 


tccagtattt 


480 


gccataaaaa 


agaaggacag 


tactaagtgg 


agaaaattag 


tagatttcag 


ggaactcaat 


540 


aaaagaactc 


aagacttttg 


ggaagttcaa 


ttaggaatac 


cacacccagc 


aggattaaaa 


600 


aagaaaaaat 


cagtgacagt 


gctagatgtg 


ggggatgcat 


atttttcagt 


tcctttagat 


660 


gaaagcttca 


ggaaatatac 


tgcattcacc 


atacctagta 


taaacaatga 


aacaccaggg 


720 


attagatatc 


aatataatgt 


gctgccacag 


ggatggaaag 


gatcaccagc 


aatattccag 


780 
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agtagcatga caaaaatctt agagcccttc agagcaaaaa atccagacat agttatctat 840 

caagccccgt tgtatgtagg atctgactta gaaatagggc aacatagagc aaaaatagaa 900 

gagttaaggg aacatttatt gaaatgggga tttacaacac cagacaagaa acatcaaaaa 960 

gaacccccat ttcttcccat cgaactccat cctgacaaat ggacagtaca acctatactg 1020 

ctgccagaaa aggatagttg gactgtcaat gatatacaga agttagtggg aaaattaaac 1080 

tgggcaagtc agatttaccc agggattaaa gtaaggcaac tctgtaaact cctcaggggg 1140 

gccaaagcac taacagacat agtaccacta actgaagaag cagaattaga attggcagag 1200 

aacagggaaa ttttaagaga accagtacat ggagtatatt atgatccatc aaaagacttg 1260 

atagctgaaa tacagaaaca ggggcatgaa caatggacat atcaaattta tcaagaacca 1320 

tttaaaaatc tgaaaacagg gaagtatgca aaaatgagga ctacccacac taatgatgta 1380 

aaacagttaa cagaggcagt gcaaaaaata gccatggaaa gcatagtaat atggggaaag 1440 

actcctaaat ttagactacc catccaaaaa gaaacatggg agacatggtg gacagactat 1500 

tggcaagcca cctggatccc tgagtgggag tttgttaata cccctcccct agtaaaatta 1560 

tggtaccaac tagaaaaaga tcccatagca ggagtagaaa ctttctatgt agatggagca 1620 

actaataggg aagctaaaat aggaaaagca gggtatgtta ctgacagagg aaggcagaaa 1680 

attgttactc taactaacac aacaaatcag aagactgagt tacaagcaat tcagctagct 1740 

ctgcaggatt caggatcaga agtaaacata gtaacagact cacagtatgc attaggaatc 1800 

attcaagcac aaccagataa gagtgactca gagatattta accaaataat agaacagtta 1860 

ataaacaagg aaagaatcta cctgtcatgg gtaccagcac ataaaggaat tgggggaaat 1920 

gaacaagtag ataaattagt aagtaaggga attaggaaag tgttg 1965 

<210> 70 
<211> 1977 
<212> DNA 
<213> Artificial 

<220> 

<223> HIV Type C Protease and Reverse Transcriptase Optimized 
<400> 70 

ccccagatca ccctgtggca gcgccccctg gtgagcatca aggtggaggg ccagatcaag 60 

gaggccctgc tggacaccgg cgccgacgac accgtgctgg aggagatcga cctgcccggc 120 

aagtggaagc ccaagatgat cggcggcatc ggcggcttca tcaaggtgcg ccagtacgac 180 
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cagatcctga tcgagatctg cggcaagaag gccatcggca ccgtgctggt gggccccacc 240 

cccgtgaaca tcatcggccg caacctgctg acccagctgg gctgcaccct gaacttcccc 300 

atcagcccca tcgagaccgt gcccgtgaag ctgaagcccg gcatggacgg ccccaaggtg 3 60 

aagcagtggc ccctgaccga ggagaagatc aaggccctga ccgccatctg cgaggagatg 420 

gagaaggagg gcaagatcac caagatcggc cccgacaacc cctacaacac ccccgtgttc 480 

gccatcaaga agaaggacag caccaagtgg cgcaagctgg tggacttccg cgagctgaac 540 

aagcgcaccc aggacttctg ggaggtgcag ctgggcatcc cccaccccgc cggcctgaag 600 

aagaagaaga gcgtgaccgt gctggacgtg ggcgacgcct acttcagcgt gcccctggac 660 

gagagcttcc gcaagtacac cgccttcacc atccccagca tcaacaacga gacccccggc 720 

atccgctacc agtacaacgt gctgccccag ggctggaagg gcagccccgc catcttccag 780 

agcagcatga ccaagatcct ggagcccttc cgcgccaaga accccgacat cgtgatctac 840 

cagtacatgg acgacctgta cgtgggcagc gacctggaga tcggccagca ccgcgccaag 900 

atcgaggagc tgcgcgagca cctgctgaag tggggcttca ccacccccga caagaagcac 960 

cagaaggagc cccccttcct gtggatgggc tacgagctgc accccgacaa gtggaccgtg 1020 

cagcccatcc tgctgcccga gaaggacagc tggaccgtga acgacatcca gaagctggtg 1080 

ggcaagctga actgggccag ccagatctac cccggcatca aggtgcgcca gctgtgcaag 1140 

ctgctgcgcg gcgccaaggc cctgaccgac atcgtgcccc tgaccgagga ggccgagctg 12 00 

gagctggccg agaaccgcga gatcctgcgc gagcccgtgc acggcgtgta ctacgacccc 12 60 

agcaaggacc tgatcgccga gatccagaag cagggccacg agcagtggac ctaccagatc 1320 

taccaggagc ccttcaagaa cctgaagacc ggcaagtacg ccaagatgcg caccacccac 1380 

accaacgacg tgaagcagct gaccgaggcc gtgcagaaga tcgccatgga gagcatcgtg 1440 

atctggggca agacccccaa gttccgcctg cccatccaga aggagacctg ggagacctgg 1500 

tggaccgact actggcaggc cacctggatc cccgagtggg agttcgtgaa cacccccccc 1560 

ctggtgaagc tgtggtacca gctggagaag gaccccatcg ccggcgtgga gaccttctac 1620 

gtggacggcg ccaccaaccg cgaggccaag atcggcaagg ccggctacgt gaccgaccgc 1680 

ggccgccaga agatcgtgac cctgaccaac accaccaacc agaagaccga gctgcaggcc 1740 

atccagctgg ccctgcagga cagcggcagc gaggtgaaca tcgtgaccga cagccagtac 1800 

gccctgggca tcatccaggc ccagcccgac aagagcgaca gcgagatctt caaccagatc 1860 

atcgagcagc tgatcaacaa ggagcgcatc tacctgagct gggtgcccgc ccacaagggc 1920 
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atcggcggca acgagcaggt ggacaagctg gtgagcaagg gcatccgcaa ggtgctg 



1977 



<210> 71 
<211> 1977 
<212> DNA 
<213> Artificial 

<220> 

<223> HIV Type C Protease and Reverse Transcriptase Wild Type 
<400> 71 

cctcaaatca ctctttggca gcgacccctt gtctcaataa aagtagaggg ccagataaag 60 

gaggctctct tagacacagg agcagatgat acagtattag aagaaataga tttgccaggg 120 

aaatggaaac caaaaatgat agggggaatt ggaggtttta tcaaagtaag acagtatgat 180 

caaatactta tagaaatttg tggaaaaaag gctataggta cagtattagt agggcctaca 240 

ccagtcaaca taattggaag aaatctgtta actcagcttg gatgcacact aaattttcca 300 

attagtccta ttgaaactgt accagtaaaa ttaaaaccag gaatggatgg cccaaaggtc 360 

aaacaatggc cattgacaga agaaaaaata aaagcattaa cagcaatttg tgaggaaatg 420 

gagaaggaag gaaaaattac aaaaattggg cctgataatc catataacac tccagtattt 480 

gccataaaaa agaaggacag tactaagtgg agaaaattag tagatttcag ggaactcaat 540 

aaaagaactc aagacttttg ggaagttcaa ttaggaatac cacacccagc aggattaaaa 600 

aagaaaaaat cagtgacagt gctagatgtg ggggatgcat atttttcagt tcctttagat 660 

gaaagcttca ggaaatatac tgcattcacc atacctagta taaacaatga aacaccaggg 720 

attagatatc aatataatgt gctgccacag ggatggaaag gatcaccagc aatattccag 780 

agtagcatga caaaaatctt agagcccttc agagcaaaaa atccagacat agttatctat 840 

caatatatgg atgacttgta tgtaggatct gacttagaaa tagggcaaca tagagcaaaa 900 

atagaagagt taagggaaca tttattgaaa tggggattta caacaccaga caagaaacat 960 

caaaaagaac ccccatttct ttggatgggg tatgaactcc atcctgacaa atggacagta 1020 

caacctatac tgctgccaga aaaggatagt tggactgtca atgatataca gaagttagtg 1080 

ggaaaattaa actgggcaag tcagatttac ccagggatta aagtaaggca actctgtaaa 1140 

ctcctcaggg gggccaaagc actaacagac atagtaccac taactgaaga agcagaatta 1200 

gaattggcag agaacaggga aattttaaga gaaccagtac atggagtata ttatgatcca 1260 

tcaaaagact tgatagctga aatacagaaa caggggcatg aacaatggac atatcaaatt 1320 
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tatcaagaac catttaaaaa tctgaaaaca gggaagtatg caaaaatgag gactacccac 
actaatgatg taaaacagtt aacagaggca gtgcaaaaaa tagccatgga aagcatagta 
atatggggaa agactcctaa atttagacta cccatccaaa aagaaacatg ggagacatgg 
tggacagact attggcaagc cacctggatc cctgagtggg agtttgttaa tacccctccc 
ctagtaaaat tatggtacca actagaaaaa gatcccatag caggagtaga aactttctat 
gtagatggag caactaatag ggaagctaaa ataggaaaag cagggtatgt tactgacaga 
ggaaggcaga aaattgttac tctaactaac acaacaaatc agaagactga gttacaagca 
attcagctag ctctgcagga ttcaggatca gaagtaaaca tagtaacaga ctcacagtat 
gcattaggaa tcattcaagc acaaccagat aagagtgact cagagatatt taaccaaata 
atagaacagt taataaacaa ggaaagaatc tacctgtcat gggtaccagc acataaagga 
attgggggaa atgaacaagt agataaatta gtaagtaagg gaattaggaa agtgttg 



<210> 72 

<211> 75 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C RevExonl Optimized 
<400> 72 

atggccggcc gcagcggcga cagcgacgag gccctgctgc aggtggtgaa gatcatcaag 
atcctgtacc agagc 



<210> 73 

<211> 76 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C RevExonl Wild Type 
<400> 73 

atggcaggaa gaagcggaga cagcgacgaa gcgctcctcc aagtggtgaa gatcatcaaa 
atcctctatc aaagca 



<210> 74 

<211> 246 

<212> DNA 

<213> Artificial 

<220> 
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<223> HIV Type C RevExon2 Optimized 
<400> 74 

ccctacccca agcccgaggg cacccgccag gcccgccgca accgccgccg ccgctggcgc 60 

gcccgccagc gccagatcca caccatcggc gagcgcatcc tggtggcctg cctgggccgc 12 0 

agcgccgagc ccgtgcccct gcagctgccc cccctggagc gcctgcacat caactgcagc 180 

gagggcagcg gcaccagcgg cacccagcag agccagggca ccaccgaggg cgtgggcgac 240 

ccctaa 246 

<210> 75 

<211> 248 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C RevExon2 Wild Type 
<400> 75 

acccttaccc caagcccgag gggactcgac aggctcggag gaatcgaaga agaaggtgga 60 
gagcaagaca gagacagatc catacgattg gtgagcggat tcttgtcgct tgcctgggac 120 
gatctgcgga gcctgtgcct cttcagctac caccgcttga gagacttcat attaattgca 180 
gtgagggcag tggaacttct gggacacagc agtctcaggg gactacagag ggggtgggag 240 
atccttaa 248 

<210> 76 

<211> 1680 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C Reverse Transcriptase Optimized 
<400> 76 

cccatcagcc ccatcgagac cgtgcccgtg aagctgaagc ccggcatgga cggccccaag 60 

gtgaagcagt ggcccctgac cgaggagaag atcaaggccc tgaccgccat ctgcgaggag 120 

atggagaagg agggcaagat caccaagatc ggccccgaca acccctacaa cacccccgtg 180 

ttcgccatca agaagaagga cagcaccaag tggcgcaagc tggtggactt ccgcgagctg 240 

aacaagcgca cccaggactt ctgggaggtg cagctgggca tcccccaccc cgccggcctg 300 

aagaagaaga agagcgtgac cgtgctggac gtgggcgacg cctacttcag cgtgcccctg 360 

gacgagagct tccgcaagta caccgccttc accatcccca gcatcaacaa cgagaccccc 420 
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ggcatccgct accagtacaa cgtgctgccc cagggctgga agggcagccc cgccatcttc 480 

cagagcagca tgaccaagat cctggagccc ttccgcgcca agaaccccga catcgtgatc 540 

taccagtaca tggacgacct gtacgtgggc agcgacctgg agatcggcca gcaccgcgcc 600 

aagatcgagg agctgcgcga gcacctgctg aagtggggct tcaccacccc cgacaagaag 660 

caccagaagg agcccccctt cctgtggatg ggctacgagc tgcaccccga caagtggacc 720 

gtgcagccca tcctgctgcc cgagaaggac agctggaccg tgaacgacat ccagaagctg 780 

gtgggcaagc tgaactgggc cagccagatc taccccggca tcaaggtgcg ccagctgtgc 840 

aagctgctgc gcggcgccaa ggccctgacc gacatcgtgc ccctgaccga ggaggccgag 900 

ctggagctgg ccgagaaccg cgagatcctg cgcgagcccg tgcacggcgt gtactacgac 960 

cccagcaagg acctgatcgc cgagatccag aagcagggcc acgagcagtg gacctaccag 1020 

atctaccagg agcccttcaa gaacctgaag accggcaagt acgccaagat gcgcaccacc 1080 

cacaccaacg acgtgaagca gctgaccgag gccgtgcaga agatcgccat ggagagcatc 1140 

gtgatctggg gcaagacccc caagttccgc ctgcccatcc agaaggagac ctgggagacc 1200 

tggtggaccg actactggca ggccacctgg atccccgagt gggagttcgt gaacaccccc 1260 

cccctggtga agctgtggta ccagctggag aaggacccca tcgccggcgt ggagaccttc 1320 

tacgtggacg gcgccaccaa ccgcgaggcc aagatcggca aggccggcta cgtgaccgac 1380 

cgcggccgcc agaagatcgt gaccctgacc aacaccacca accagaagac cgagctgcag 1440 

gccatccagc tggccctgca ggacagcggc agcgaggtga acatcgtgac cgacagccag 1500 

tacgccctgg gcatcatcca ggcccagccc gacaagagcg acagcgagat cttcaaccag 1560 

atcatcgagc agctgatcaa caaggagcgc atctacctga gctgggtgcc cgcccacaag 1620 

ggcatcggcg gcaacgagca ggtggacaag ctggtgagca agggcatccg caaggtgctg 1680 

<210> 77 
<211> 1680 
<212> DNA 
<213> Artificial 

<220> 

<223> HIV Type C Reverse Transcriptase Wild Type 
<400> 77 

ccaattagtc ctattgaaac tgtaccagta aaattaaaac caggaatgga tggcccaaag 60 

gtcaaacaat ggccattgac agaagaaaaa ataaaagcat taacagcaat ttgtgaggaa 12 0 

atggagaagg aaggaaaaat tacaaaaatt gggcctgata atccatataa cactccagta 180 
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tttgccataa aaaagaagga cagtactaag tggagaaaat tagtagattt cagggaactc 240 

aataaaagaa ctcaagactt ttgggaagtt caattaggaa taccacaccc agcaggatta 3 00 

aaaaagaaaa aatcagtgac agtgctagat gtgggggatg catatttttc agttccttta 3 60 

gatgaaagct tcaggaaata tactgcattc accataccta gtataaacaa tgaaacacca 420 

gggattagat atcaatataa tgtgctgcca cagggatgga aaggatcacc agcaatattc 480 

cagagtagca tgacaaaaat cttagagccc ttcagagcaa aaaatccaga catagttatc 540 

tatcaatata tggatgactt gtatgtagga tctgacttag aaatagggca acatagagca 600 

aaaatagaag agttaaggga acatttattg aaatggggat ttacaacacc agacaagaaa 660 

catcaaaaag aacccccatt tctttggatg gggtatgaac tccatcctga caaatggaca 720 

gtacaaccta tactgctgcc agaaaaggat agttggactg tcaatgatat acagaagtta 780 

gtgggaaaat taaactgggc aagtcagatt tacccaggga ttaaagtaag gcaactctgt 840 

aaactcctca ggggggccaa agcactaaca gacatagtac cactaactga agaagcagaa 900 

ttagaattgg cagagaacag ggaaatttta agagaaccag tacatggagt atattatgat 960 

ccatcaaaag acttgatagc tgaaatacag aaacaggggc atgaacaatg gacatatcaa 1020 

atttatcaag aaccatttaa aaatctgaaa acagggaagt atgcaaaaat gaggactacc 1080 

cacactaatg atgtaaaaca gttaacagag gcagtgcaaa aaatagccat ggaaagcata 1140 

gtaatatggg gaaagactcc taaatttaga ctacccatcc aaaaagaaac atgggagaca 12 00 

tggtggacag actattggca agccacctgg atccctgagt gggagtttgt taatacccct 12 60 

cccctagtaa aattatggta ccaactagaa aaagatccca tagcaggagt agaaactttc 1320 

tatgtagatg gagcaactaa tagggaagct aaaataggaa aagcagggta tgttactgac 1380 

agaggaaggc agaaaattgt tactctaact aacacaacaa atcagaagac tgagttacaa 1440 

gcaattcagc tagctctgca ggattcagga tcagaagtaa acatagtaac agactcacag 1500 

tatgcattag gaatcattca agcacaacca gataagagtg actcagagat atttaaccaa 1560 

ataatagaac agttaataaa caaggaaaga atctacctgt catgggtacc agcacataaa 1620 

ggaattgggg gaaatgaaca agtagataaa ttagtaagta agggaattag gaaagtgttg 1680 

<210> 78 
<211> 1668 
<212> DNA 
<213> Artificial 
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<220> 

<223> HIV Type C Mutated Reverse Transcriptase Optimized 
<400> 78 

cccatcagcc ccatcgagac cgtgcccgtg aagctgaagc ccggcatgga cggccccaag 60 

gtgaagcagt ggcccctgac cgaggagaag atcaaggccc tgaccgccat ctgcgaggag 120 

atggagaagg agggcaagat caccaagatc ggccccgaca acccctacaa cacccccgtg 180 

ttcgccatca agaagaagga cagcaccaag tggcgcaagc tggtggactt ccgcgagctg 240 

aacaagcgca cccaggactt ctgggaggtg cagctgggca tcccccaccc cgccggcctg 300 

aagaagaaga agagcgtgac cgtgctggac gtgggcgacg cctacttcag cgtgcccctg 360 

gacgagagct tccgcaagta caccgccttc accatcccca gcatcaacaa cgagaccccc 420 

ggcatccgct accagtacaa cgtgctgccc cagggctgga agggcagccc cgccatcttc 480 

cagagcagca tgaccaagat cctggagccc ttccgcgcca agaaccccga catcgtgatc 540 

taccaggccc ccctgtacgt gggcagcgac ctggagatcg gccagcaccg cgccaagatc 600 

gaggagctgc gcgagcacct gctgaagtgg ggcttcacca cccccgacaa gaagcaccag 660 

aaggagcccc ccttcctgcc catcgagctg caccccgaca agtggaccgt gcagcccatc 720 

ctgctgcccg agaaggacag ctggaccgtg aacgacatcc agaagctggt gggcaagctg 780 

aactgggcca gccagatcta ccccggcatc aaggtgcgcc agctgtgcaa gctgctgcgc 840 

ggcgccaagg ccctgaccga catcgtgccc ctgaccgagg aggccgagct ggagctggcc 900 

gagaaccgcg agatcctgcg cgagcccgtg cacggcgtgt actacgaccc cagcaaggac 960 

ctgatcgccg agatccagaa gcagggccac gagcagtgga cctaccagat ctaccaggag 1020 

cccttcaaga acctgaagac cggcaagtac gccaagatgc gcaccaccca caccaacgac 1080 

gtgaagcagc tgaccgaggc cgtgcagaag atcgccatgg agagcatcgt gatctggggc 1140 

aagaccccca agttccgcct gcccatccag aaggagacct gggagacctg gtggaccgac 1200 

tactggcagg ccacctggat ccccgagtgg gagttcgtga acaccccccc cctggtgaag 1260 

ctgtggtacc agctggagaa ggaccccatc gccggcgtgg agaccttcta cgtggacggc 1320 

gccaccaacc gcgaggccaa gatcggcaag gccggctacg tgaccgaccg cggccgccag 1380 

aagatcgtga ccctgaccaa caccaccaac cagaagaccg agctgcaggc catccagctg 1440 

gccctgcagg acagcggcag cgaggtgaac atcgtgaccg acagccagta cgccctgggc 1500 

atcatccagg cccagcccga caagagcgac agcgagatct tcaaccagat catcgagcag 1560 

ctgatcaaca aggagcgcat ctacctgagc tgggtgcccg cccacaaggg catcggcggc 1620 

79 



aacgagcagg tggacaagct ggtgagcaag ggcatccgca aggtgctg 



1668 



<210> 79 
<211> 1668 
<212> DNA 
<213> Artificial 

<220> 

<223> HIV Type C Mutated Reverse Transcriptase Wild Type 
<400> 79 

ccaattagtc ctattgaaac tgtaccagta aaattaaaac caggaatgga tggcccaaag 60 

gtcaaacaat ggccattgac agaagaaaaa ataaaagcat taacagcaat ttgtgaggaa 120 

atggagaagg aaggaaaaat tacaaaaatt gggcctgata atccatataa cactccagta 180 

tttgccataa aaaagaagga cagtactaag tggagaaaat tagtagattt cagggaactc 240 

aataaaagaa ctcaagactt ttgggaagtt caattaggaa taccacaccc agcaggatta 300 

aaaaagaaaa aatcagtgac agtgctagat gtgggggatg catatttttc agttccttta 360 

gatgaaagct tcaggaaata tactgcattc accataccta gtataaacaa tgaaacacca 42 0 

gggattagat atcaatataa tgtgctgcca cagggatgga aaggatcacc agcaatattc 480 

cagagtagca tgacaaaaat cttagagccc ttcagagcaa aaaatccaga catagttatc 540 

tatcaagccc cgttgtatgt aggatctgac ttagaaatag ggcaacatag agcaaaaata 600 

gaagagttaa gggaacattt attgaaatgg ggatttacaa caccagacaa gaaacatcaa 660 

aaagaacccc catttcttcc catcgaactc catcctgaca aatggacagt acaacctata 720 

ctgctgccag aaaaggatag ttggactgtc aatgatatac agaagttagt gggaaaatta 780 

aactgggcaa gtcagattta cccagggatt aaagtaaggc aactctgtaa actcctcagg 840 

ggggccaaag cactaacaga catagtacca ctaactgaag aagcagaatt agaattggca 900 

gagaacaggg aaattttaag agaaccagta catggagtat attatgatcc atcaaaagac 960 

ttgatagctg aaatacagaa acaggggcat gaacaatgga catatcaaat ttatcaagaa 1020 

ccatttaaaa atctgaaaac agggaagtat gcaaaaatga ggactaccca cactaatgat 1080 

gtaaaacagt taacagaggc agtgcaaaaa atagccatgg aaagcatagt aatatgggga 1140 

aagactccta aatttagact acccatccaa aaagaaacat gggagacatg gtggacagac 1200 

tattggcaag ccacctggat ccctgagtgg gagtttgtta atacccctcc cctagtaaaa 1260 

ttatggtacc aactagaaaa agatcccata gcaggagtag aaactttcta tgtagatgga 1320 
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gcaactaata gggaagctaa aataggaaaa gcagggtatg ttactgacag aggaaggcag 1380 

aaaattgtta ctctaactaa cacaacaaat cagaagactg agttacaagc aattcagcta 1440 

gctctgcagg attcaggatc agaagtaaac atagtaacag actcacagta tgcattagga 1500 

atcattcaag cacaaccaga taagagtgac tcagagatat ttaaccaaat aatagaacag 1560 

ttaataaaca aggaaagaat ctacctgtca tgggtaccag cacataaagg aattggggga 1620 

aatgaacaag tagataaatt agtaagtaag ggaattagga aagtgttg 1668 

<210> 80 

<211> 216 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C TatC22Exonl Optimized 
<400> 80 

atggagcccg tggaccccaa gctgaagccc tggaaccacc ccggcagcca gcccaagacc 60 
gccggcaaca actgcttctg caagcactgc agctaccact gcctggtgtg cttccagacc 120 
aagggcctgg gcatcagcta cggccgcaag aagcgccgcc agcgccgcag cgcccccccc 180 
agcggcgagg accaccagaa ccccctgagc aagcag 216 

<210> 81 

<211> 216 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C TatExonl Optimized 
<406> 81 

atggagcccg tggaccccaa gctgaagccc tggaaccacc ccggcagcca gcccaagacc 60 
gcctgcaaca actgcttctg caagcactgc agctaccact gcctggtgtg cttccagacc 120 
aagggcctgg gcatcagcta cggccgcaag aagcgccgcc agcgccgcag cgcccccccc 180 
agcggcgagg accaccagaa ccccctgagc aagcag 216 



<210> 82 

<211> 216 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C TatExonl Wild Type 
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<400> 82 

atggagccag tagatcctaa actaaagccc tggaaccatc caggaagcca acctaaaaca 
gcttgtaata attgcttttg caaacactgt agctatcatt gtctagtttg ctttcagaca 
aaaggtttag gcatttccta tggcaggaag aagcggagac agcgacgaag cgctcctcca 
agtggtgaag atcatcaaaa tcctctatca aagcag 

<210> 83 

<211> 93 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C TatExon2 Optimized 
<400> 83 

cccctgcccc aggcccgcgg cgacagcacc ggcagcgagg agagcaagaa gaaggtggag 
agcaagaccg agaccgaccc ctacgactgg tga 



<210> 84 

<211> 93 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C TatExon2 Wild Type 
<400> 84 

cccttacccc aagcccgagg ggactcgaca ggctcggagg aatcgaagaa gaaggtggag 
agcaagacag agacagatcc atacgattgg tga 



<210> 85 

<211> 579 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C Vif Optimized 
<400> 85 

atggagaacc gctggcaggt gctgatcgtg tggcaggtgg accgcatgaa gatccgcgcc 
tggaacagcc tggtgaagca ccacatgtac atcagccgcc gcgccagcgg ctgggtgtac 
cgccaccact tcgagagccg ccaccccaag gtgagcagcg aggtgcacat ccccctgggc 
gacgcccgcc tggtgatcaa gacctactgg ggcctgcaga ccggcgagcg cgactggcac 
ctgggccacg gcgtgagcat cgagtggcgc ctgcgcgagt acagcaccca ggtggacccc 
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gacctggccg accagctgat ccacatgcac tacttcgact gcttcaccga gagcgccatc 360 

cgccaggcca tcctgggcca catcgtgttc ccccgctgcg actaccaggc cggccacaag 420 

aaggtgggca gcctgcagta cctggccctg accgccctga tcaagcccaa gaagcgcaag 480 

ccccccctgc ccagcgtgcg caagctggtg gaggaccgct ggaacgaccc ccagaagacc 540 

cgcggccgcc gcggcaacca caccatgaac ggccactag 579 

<210> 86 

<211> 579 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C Vif Wild Type 
<400> 86 

atggaaaaca gatggcaggt gctgattgtg tggcaggtgg acaggatgaa gattagagca 60 

tggaatagtt tagtaaagca ccatatgtat atatcaagga gagctagtgg atgggtctac 120 

agacatcatt ttgaaagcag acatccaaaa gtaagttcag aagtacatat cccattaggg 180 

gatgctagat tagtaataaa aacatattgg ggtttgcaga caggagaaag agattggcat 240 

ttgggtcatg gagtctccat agaatggaga ctgagagaat acagcacaca agtagaccct 300 

gacctggcag accagctaat tcacatgcat tattttgatt gttttacaga atctgccata 360 

agacaagcca tattaggaca catagttttt cctaggtgtg actatcaagc aggacataag 420 

aaggtaggat ctctgcaata cttggcactg acagcattga taaaaccaaa aaagagaaag 480 

ccacctctgc ctagtgttag aaaattagta gaggatagat ggaacgaccc ccagaagacc 540 

aggggccgca gagggaacca tacaatgaat ggacactag 579 

<210> 87 

<211> 288 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C Vpr Optimized 
<400> 87 

atggagcgcc cccccgagga ccagggcccc cagcgcgagc cctacaacga gtggaccctg 60 
gagatcctgg aggagctgaa gcaggaggcc gtgcgccact tcccccgccc ctggctgcac 120 
agcctgggcc agtacatcta cgagacctac ggcgacacct ggaccggcgt ggaggccatc 180 
atccgcgtgc tgcagcagct gctgttcatc cacttccgca tcggctgcca gcacagccgc 240 
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atcggcatcc tgcgccagcg ccgcgcccgc aacggcgcca gccgcagc 288 



<210> 88 

<211> 288 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C Vpr Wild Type 
<400> 88 

atggaacgac ccccagaaga ccaggggccg cagagggaac catacaatga atggacacta 
gagattctag aagaactcaa gcaggaagct gtcagacact ttcctagacc atggctccat 
agcttaggac aatatatcta tgaaacctat ggggatactt ggacgggagt tgaagctata 
ataagagtac tgcaacaact actgttcatt catttcagaa ttggatgcca acatagcaga 
ataggcatct tgcgacagag aagagcaaga aatggagcca gtagatcc 



<210> 89 

<211> 267 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C Vpu Optimized 
<400> 89 

atggtgagcc tgagcctgtt caagggcgtg gactaccgcc tgggcgtggg cgccctgatc 
gtggccctga tcatcgccat catcgtgtgg accatcgcct acatcgagta ccgcaagctg 
gtgcgccaga agaagatcga ctggctgatc aagcgcatcc gcgagcgcgc cgaggacagc 
ggcaacgaga gcgacggcga caccgaggag ctgagcacca tggtggacat gggccacctg 
cgcctgctgg acgccaacga cctgtaa 



<210> 90 

<211> 267 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C Vpu Wild Type 
<400> 90 

atggtaagtt taagtttatt taaaggagta gattatagat taggagtagg agcattgata 60 
gtagcactaa tcatagcaat aatagtgtgg accatagcat atatagaata taggaaattg 120 



60 
120 
180 
240 
288 



60 
120 
180 
240 
267 
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gtaagacaaa agaaaataga ctggttaatt aaaagaatta gggaaagagc agaagacagt 180 
ggcaatgaga gtgatgggga cacagaagaa ttgtcaacaa tggtggatat ggggcatctt 240 
aggcttctgg atgctaatga tttgtaa 267 

<210> 91 

<211> 321 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C RevExon 1 and 2 Optimized 
<400> 91 

atggccggcc gcagcggcga cagcgacgag gccctgctgc aggtggtgaa gatcatcaag 60 

atcctgtacc agagccccta ccccaagccc gagggcaccc gccaggcccg ccgcaaccgc 120 

cgccgccgct ggcgcgcccg ccagcgccag atccacacca tcggcgagcg catcctggtg 180 

gcctgcctgg gccgcagcgc cgagcccgtg cccctgcagc tgccccccct ggagcgcctg 240 

cacatcaact gcagcgaggg cagcggcacc agcggcaccc agcagagcca gggcaccacc 300 

gagggcgtgg gcgaccccta a 321 

<210> 92 

<211> 324 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C RevExon 1 and 2 Wild Type 
<400> 92 

atggcaggaa gaagcggaga cagcgacgaa gcgctcctcc aagtggtgaa gatcatcaaa 60 

atcctctatc aaagcaaccc ttaccccaag cccgagggga ctcgacaggc tcggaggaat 120 

cgaagaagaa ggtggagagc aagacagaga cagatccata cgattggtga gcggattctt 180 

gtcgcttgcc tgggacgatc tgcggagcct gtgcctcttc agctaccacc gcttgagaga 240 

cttcatatta attgcagtga gggcagtgga acttctggga cacagcagtc tcaggggact 300 

acagaggggg tgggagatcc ttaa 324 



<210> 93 

<211> 309 

<212> DNA 

<213> Artificial 

<220> 
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<223> HIV Type C TatC22 Exon 1 and 2 Optimized 
<400> 93 

atggagcccg tggaccccaa gctgaagccc tggaaccacc ccggcagcca gcccaagacc 60 

gccggcaaca actgcttctg caagcactgc agctaccact gcctggtgtg cttccagacc 120 

aagggcctgg gcatcagcta cggccgcaag aagcgccgcc agcgccgcag cgcccccccc 180 

sgcggcgagg accaccagaa ccccctgagc aagcagcccc tgccccaggc ccgcggcgac 240 

agcaccggca gcgaggagag caagaagaag gtggagagca agaccgagac cgacccctac 300 

gactggtga 309 

<210> 94 

<211> 309 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C Tat Exon 1 and 2 Optimized 
<400> 94 

atggagcccg tggaccccaa gctgaagccc tggaaccacc ccggcagcca gcccaagacc 60 

gcctgcaaca actgcttctg caagcactgc agctaccact gcctggtgtg cttccagacc 120 

aagggcctgg gcatcagcta cggccgcaag aagcgccgcc agcgccgcag cgcccccccc 180 

agcggcgagg accaccagaa ccccctgagc aagcagcccc tgccccaggc ccgcggcgac 240 

agcaccggca gcgaggagag caagaagaag gtggagagca agaccgagac cgacccctac 300 

gactggtga 309 

<210> 95 

<211> 309 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C Tat Exon 1 and 2 Wild Type 
<400> 95 

atggagccag tagatcctaa actaaagccc tggaaccatc caggaagcca acctaaaaca 60 
gcttgtaata attgcttttg caaacactgt agctatcatt gtctagtttg ctttcagaca 120 
aaaggtttag gcatttccta tggcaggaag aagcggagac agcgacgaag cgctcctcca 180 
agtggtgaag atcatcaaaa tcctctatca aagcagccct taccccaagc ccgaggggac 240 
tcgacaggct cggaggaatc gaagaagaag gtggagagca agacagagac agatccatac 300 
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gattggtga 



309 



<210> 96 

<211> 624 

<212> DNA 

<213> Artificial 

<220> 

<223> HIV Type C NefD125g Optimized Myristalization Modification 
<400> 96 

atggccggca agtggagcaa gcgcagcatc gtgggctggc ccgccgtgcg cgagcgcatg 60 

cgccgcaccg agcccgccgc cgagggcgtg ggcgccgcca gccaggacct ggaccgccac 120 

ggcgccctga ccagcagcaa cacccccgcc accaacgagg cctgcgcctg gctgcaggcc 180 

caggaggagg acggcgacgt gggcttcccc gtgcgccccc aggtgcccct gcgccccatg 240 

acctacaaga gcgccgtgga cctgagcttc ttcctgaagg agaagggcgg cctggagggc 300 

ctgatctaca gccgcaagcg ccaggagatc ctggacctgt gggtgtacaa cacccagggc 360 

ttcttccccg gctggcagaa ctacaccagc ggccccggcg tgcgcttccc cctgaccttc 420 

ggctggtgct tcaagctggt gcccgtggac ccccgcgagg tgaaggaggc caacgagggc 480 

gaggacaact gcctgctgca ccccatgagc cagcacggcg ccgaggacga ggaccgcgag 540 

gtgctgaagt ggaagttcga cagcctgctg gcccaccgcc acatggcccg cgagctgcac 600 

cccgagtact acaaggactg ctga 624 

<210> 97 

<211> 2565 

<212> DNA 

<213> Artificial 

<220> 

< 2 2 3 > Envgpl 6 0_TV2_C_ZAop t 
<400> 97 

atgcgcgccc gcggcatcct gaagaactac cgccactggt ggatctgggg catcctgggc 60 

ttctggatgc tgatgatgtg caacgtgaag ggcctgtggg tgaccgtgta ctacggcgtg 12 0 

cccgtgggcc gcgaggccaa gaccaccctg ttctgcgcca gcgacgccaa ggcctacgag 180 

aaggaggtgc acaacgtgtg ggccacccac gcctgcgtgc ccaccgaccc caacccccag 240 

gaggtgatcc tgggcaacgt gaccgagaac ttcaacatgt ggaagaacga catggtggac 300 

cagatgcagg aggacatcat cagcctgtgg gaccagagcc tgaagccctg cgtgaagctg 360 

acccccctgt gcgtgaccct gaactgcacc aacgccaccg tgaactacaa caacaccagc 42 0 
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aaggacatga agaactgcag cttctacgtg accaccgagc tgcgcgacaa gaagaagaag 480 

gagaacgccc tgttctaccg cctggacatc gtgcccctga acaaccgcaa gaacggcaac 540 

atcaacaact accgcctgat caactgcaac accagcgcca tcacccaggc ctgccccaag 600 

gtgagcttcg accccatccc catccactac tgcgcccccg ccggctacgc ccccctgaag 660 

tgcaacaaca agaagttcaa cggcatcggc ccctgcgaca acgtgagcac cgtgcagtgc 720 

acccacggca tcaagcccgt ggtgagcacc cagctgctgc tgaacggcag cctggccgag 780 

gaggagatca tcatccgcag cgagaacctg accaacaacg tgaagaccat catcgtgcac 840 

ctgaacgaga gcatcgagat caagtgcacc cgccccggca acaacacccg caagagcgtg 900 

cgcatcggcc ccggccaggc cttctacgcc accggcgaca tcatcggcga catccgccag 960 

gcccactgca acatcagcaa gaacgagtgg aacaccaccc tgcagcgcgt gagccagaag 1020 

ctgcaggagc tgttccccaa cagcaccggc atcaagttcg ccccccacag cggcggcgac 1080 

ctggagatca ccacccacag cttcaactgc ggcggcgagt tcttctactg caacaccacc 1140 

gacctgttca acagcaccta cagcaacggc acctgcacca acggcacctg catgagcaac 1200 

aacaccgagc gcatcaccct gcagtgccgc atcaagcaga tcatcaacat gtggcaggag 1260 

gtgggccgcg ccatgtacgc cccccccatc gccggcaaca tcacctgccg cagcaacatc 1320 

accggcctgc tgctgacccg cgacggcggc gacaacaaca ccgagaccga gaccttccgc 1380 

cccggcggcg gcgacatgcg cgacaactgg cgcagcgagc tgtacaagta caaggtggtg 1440 

gagatcaagc ccctgggcgt ggcccccacc gccgccaagc gccgcgtggt ggagcgcgag 1500 

aagcgcgccg tgggcatcgg cgccgtgttc ctgggcttcc tgggcgccgc cggcagcacc 1560 

atgggcgccg ccagcatcac cctgaccgtg caggcccgcc agctgctgag cggcatcgtg 1620 

cagcagcaga gcaacctgct gcgcgccatc gaggcccagc agcacatgct gcagctgacc 1680 

gtgtggggca tcaagcagct gcaggcccgc gtgctggcca tcgagcgcta cctgcaggac 1740 

cagcagctgc tgggcctgtg gggctgcagc ggcaagctga tctgcaccac caacgtgctg 1800 

tggaacagca gctggagcaa caagacccag agcgacatct gggacaacat gacctggatg 1860 

cagtgggacc gcgagatcag caactacacc aacaccatct accgcctgct ggaggacagc 1920 

cagagccagc aggagcgcaa cgagaaggac ctgctggccc tggaccgctg gaacaacctg 1980 

tggaactggt tcagcatcac caactggctg tggtacatca agatcttcat catgatcgtg 2040 

ggcggcctga tcggcctgcg catcatcttc gccgtgctga gcctggtgaa ccgcgtgcgc 2100 
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cagggctaca gccccctgag cctgcagacc ctgatcccca acccccgcgg ccccgaccgc 2160 

ctgggcggca tcgaggagga gggcggcgag caggacagca gccgcagcat ccgcctggtg 2220 

agcggcttcc tgaccctggc ctgggacgac ctgcgcagcc tgtgcctgtt ctgctaccac 2280 

cgcctgcgcg acttcatcct gatcgtggtg cgcgccgtgg agctgctggg ccacagcagc 2340 

ctgcgcggcc tgcagcgcgg ctggggcacc ctgaagtacc tgggcagcct ggtgcagtac 2400 

tggggcctgg agctgaagaa gagcgccatc aacctgctgg acaccatcgc catcgccgtg 2460 

gccgagggca ccgaccgcat cctggagttc atccagaacc tgtgccgcgg catccgcaac 2520 

gtgccccgcc gcatccgcca gggcttcgag gccgccctgc agtaa 2565 

<210> 98 
<211> 2565 
<212> DNA 
<213> Artificial 

<220> 

< 2 2 3 > Envgpl 6 0_TV2_C_Z Awt 
<400> 98 

atgagagcga gggggatact gaagaattat cgacactggt ggatatgggg catcttaggc 60 

ttttggatgc taatgatgtg taatgtgaag ggcttgtggg tcacagtcta ctacggggta 120 

cctgtgggga gagaagcaaa aactactcta ttttgtgcat cagatgctaa agcatatgag 180 

aaagaagtgc ataatgtctg ggctacacat gcctgtgtac ccacagaccc caacccacaa 240 

gaagtgattt tgggcaatgt aacagaaaat tttaacatgt ggaaaaatga catggtggat 300 

cagatgcagg aagatataat cagtttatgg gatcaaagcc ttaagccatg tgtaaaattg 360 

accccactct gtgtcacttt aaactgtaca aatgcaactg ttaactacaa taatacctct 420 

aaagacatga aaaattgctc tttctatgta accacagaat taagagataa gaaaaagaaa 480 

gaaaatgcac ttttttatag acttgatata gtaccactta ataataggaa gaatgggaat 540 

attaacaact atagattaat aaattgtaat acctcagcca taacacaagc ctgtccaaaa 600 

gtctcgtttg acccaattcc tatacattat tgtgctccag ctggttatgc gcctctaaaa 660 

tgtaataata agaaattcaa tggaatagga ccatgcgata atgtcagcac agtacaatgt 720 

acacatggaa ttaagccagt ggtatcaact caattactgt taaatggtag cctagcagaa 780 

gaagagataa taattagatc tgaaaatctg acaaacaatg tcaaaacaat aatagtacat 840 

cttaatgaat ctatagagat taaatgtaca agacctggca ataatacaag aaagagtgtg 900 

agaataggac caggacaagc attctatgca acaggagaca taataggaga tataagacaa 960 
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gcacattgta acattagtaa aaatgaatgg aatacaactt tacaaagggt aagtcaaaaa 102 0 

ttacaagaac tcttccctaa tagtacaggg ataaaatttg caccacactc aggaggggac 1080 

ctagaaatta ctacacatag ctttaattgt ggaggagaat ttttctattg caatacaaca 1140 

gacctgttta atagtacata cagtaatggt acatgcacta atggtacatg catgtctaat 1200 

aatacagagc gcatcacact ccaatgcaga ataaaacaaa ttataaacat gtggcaggag 1260 

gtaggacgag caatgtatgc ccctcccatt gcaggaaaca taacatgtag atcaaatatt 1320 

acaggactac tattaacacg tgatggagga gataataata ctgaaacaga gacattcaga 1380 

cctggaggag gagacatgag ggacaattgg agaagtgaat tatataaata caaggtggta 1440 

gaaattaaac cattaggagt agcacccact gctgcaaaaa ggagagtggt ggagagagaa 1500 

aaaagagcag taggaatagg agctgtgttc cttgggttct tgggagcagc aggaagcact 1560 

atgggcgcag catcaataac gctgacggta caggccagac aattattgtc tggtatagtg 1620 

caacagcaaa gtaatttgct gagggctata gaggcgcaac agcatatgtt gcaactcacg 1680 

gtctggggca ttaagcagct ccaggcaaga gtcctggcta tagagagata cctacaggat 1740 

caacagctcc taggactgtg gggctgctct ggaaaactca tctgcaccac taatgtgctt 1800 

tggaactcta gttggagtaa taaaactcaa agtgatattt gggataacat gacctggatg 1860 

cagtgggata gggaaattag taattacaca aacacaatat acaggttgct tgaagactcg 1920 

caaagccagc aggaaagaaa tgaaaaagat ttactagcat tggacaggtg gaacaatctg 1980 

tggaattggt ttagcataac aaattggctg tggtatataa aaatattcat aatgatagta 2040 

ggaggcttga taggtttaag aataattttt gctgtgctct ctctagtaaa tagagttagg 2100 

cagggatact cacccttgtc attgcagacc cttatcccaa acccgagggg acccgacagg 2160 

ctcggaggaa tcgaagaaga aggtggagag caagacagca gcagatccat tcgattagtg 2220 

agcggattct tgacacttgc ctgggacgac ctacgaagcc tgtgcctctt ctgctaccac 2280 

cgattgagag acttcatatt aattgtagtg agagcagtgg aacttctggg acacagtagt 2340 

ctcaggggac tgcagagggg gtggggaacc cttaagtatt tggggagtct tgtgcaatat 2400 

tggggtctag agttaaaaaa gagtgctatt aatctgcttg atactatagc aatagcagta 2460 

gctgaaggaa cagataggat tctagaattc atacaaaacc tttgtagagg tatccgcaac 2520 

gtacctagaa gaataagaca gggcttcgaa gcagctttgc aataa 2565 

<210> 99 
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<211> 1491 

<212> DNA 

<213> Artificial 

<220> 

< 2 2 3 > Gag_TV2_C_Z Aop t 
<400> 99 

atgggcgccc gcgccagcat cctgcgcggc ggcaagctgg acaagtggga gaagatccgc 60 

ctgcgccccg gcggccgcaa gcactacatg ctgaagcacc tggtgtgggc cagccgcgag 120 

ctggagcgct tcgccgtgaa ccccggcctg ctggagacca gcgacggctg ccgccagatc 180 

atcaagcagc tgcagcccgc cctgcagacc ggcaccgagg agatccgcag cctgttcaac 240 

accgtggcca ccctgtactg cgtgcacaag ggcatcgacg tgcgcgacac caaggaggcc 300 

ctggacaaga tcgaggagga gcagaacaag tgccagcaga agacccagca ggccgaggcc 360 

gccgacaaga aggtgagcca gaactacccc atcgtgcaga acctgcaggg ccagatggtg 420 

caccaggcca tcagcccccg caccctgaac gcctgggtga aggtgatcga ggagaaggcc 480 

ttcagccccg aggtgatccc catgttcacc gccctgagcg agggcgccac cccccaggac 540 

ctgaacacca tgctgaacac cgtgggcggc caccaggccg ccatgcagat gctgaaggac 600 

accatcaacg aggaggccgc cgagtgggac cgcctgcacc ccgtgcacgc cggccccgtg 660 

gcccccggcc agatgcgcga gccccgcggc agcgacatcg ccggcaccac cagcaccctg 720 

caggagcaga tcgcctggat gaccagcaac ccccccatcc ccgtgggcga catctacaag 780 

cgctggatca tcctgggcct gaacaagatc gtgcgcatgt acagccccgt gagcatcctg 840 

gacatcaagc agggccccaa ggagcccttc cgcgactacg tggaccgctt cttcaagacc 900 

ctgcgcgccg agcagagcac ccaggaggtg aagaactgga tgaccgacac cctgctggtg 960 

cagaacgcca accccgactg caagaccatc ctgcgcgccc tgggccccgg cgccagcctg 1020 

gaggagatga tgaccgcctg ccagggcgtg ggcggcccca gccacaaggc ccgcgtgctg 1080 

gccgaggcca tgagccaggc caacaacacc agcgtgatga tccagaagag caacttcaag 1140 

ggcccccgcc gcgccgtgaa gtgcttcaac tgcggccgcg agggccacat cgcccgcaac 1200 

tgccgcgccc cccgcaagcg cggctgctgg aagtgcggca aggagggcca ccagatgaag 1260 

gactgcaccg agcgccaggc caacttcctg ggcaagatct ggcccagcca caagggccgc 1320 

cccggcaact tcctgcagag ccgccccgag cccaccgccc cccccctgga gcccaccgcc 1380 

ccccccgccg agagcttcaa gttcaaggag acccccaagc aggagcccaa ggaccgcgag 1440 

cccctgacca gcctgaagag cctgttcggc agcgaccccc tgagccagta a 1491 
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<210> 100 
<211> 1491 
<212> DNA 
<213> Artificial 

<220> 

< 2 2 3 > Gag_TV2_C_2Awt 
<400> 100 

atgggtgcga gagcgtcaat attaagaggg ggaaaattag acaaatggga aaaaattagg 60 

ttacggccag gggggagaaa acactatatg ctaaaacacc tagtatgggc aagcagagag 120 

ctggaaagat ttgcagttaa ccctggcctt ttagagacat cagacggatg tagacaaata 180 

ataaaacagc tacaaccagc tcttcagaca ggaacagagg aaattagatc attatttaac 240 

acagtagcaa ctctctattg tgtacataaa gggatagatg tacgagacac caaggaagcc 300 

ttagacaaga tagaggagga acaaaacaaa tgtcagcaaa aaacacagca ggcggaagcg 360 

gctgacaaaa aggtcagtca aaattatcct atagtgcaga acctccaagg gcaaatggta 420 

caccaggcca tatcacctag aaccttgaat gcatgggtaa aagtaataga ggagaaggct 480 

tttagcccag aggtaatacc catgtttaca gcattatcag aaggagccac cccacaagat 540 

ttaaacacca tgttaaatac agtgggggga catcaagcag ccatgcaaat gttaaaagat 600 

accatcaatg aggaggctgc agaatgggat aggttacatc cagtacatgc agggcctgtt 660 

gcaccaggcc agatgagaga accaagggga agtgacatag caggaactac tagtaccctt 720 

caagaacaaa tagcatggat gacaagtaac ccacctatcc cagtagggga catctataaa 780 

aggtggataa ttctggggtt aaataaaata gtaagaatgt acagccctgt cagcatttta 840 

gacataaaac aaggaccaaa ggaacccttt agagactatg tagaccggtt cttcaaaact 900 

ttaagagctg aacaatctac acaagaggta aaaaattgga tgacagacac cttgttagtc 960 

caaaatgcga acccagattg taagaccatt ttaagagcat taggaccagg ggcttcatta 1020 

gaagaaatga tgacagcatg tcagggagtg ggaggaccta gccacaaagc aagagttttg 1080 

gctgaggcaa tgagccaagc aaacaataca agtgtaatga tacagaaaag caattttaaa 1140 

ggccctagaa gagctgttaa atgtttcaac tgtggcaggg aagggcacat agccaggaat 1200 

tgcagggccc ctaggaaaag gggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa 1260 

gactgtactg agaggcaggc taatttttta gggaaaattt ggccttccca caaggggagg 1320 

ccagggaatt tccttcagag cagaccagag ccaacagccc caccactaga accaacagcc 13 80 
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ccaccagcag agagcttcaa gttcaaggag actccgaagc aggagccgaa agacagggaa 
cctttaactt ccctcaaatc actctttggc agcgacccct tgtctcaata a 



1440 
1491 



<210> 101 

<211> 624 

<212> DNA 

<213> Artificial 

<220> 

< 2 2 3 > Ne f _TV2_C_Z Aop t 
<400> 101 

atgggcggca agtggagcaa gagcagcatc atcggctggc ccgaggtgcg cgagcgcatc 60 

cgccgcaccc gcagcgccgc cgagggcgtg ggcagcgcca gccaggacct ggagaagcac 120 

ggcgccctga ccaccagcaa caccgcccac aacaacgccg cctgcgcctg gctggaggcc 180 

caggaggagg agggcgaggt gggcttcccc gtgcgccccc aggtgcccct gcgccccatg 240 

acctacaagg ccgccatcga cctgagcttc ttcctgaagg agaagggcgg cctggagggc 300 

ctgatctaca gcaagaagcg ccaggagatc ctggacctgt gggtgtacaa cacccagggc 3 60 

ttcttccccg actggcagaa ctacaccccc ggccccggcg tgcgcttccc cctgaccttc 420 

ggctggtact tcaagctgga gcccgtggac ccccgcgagg tggaggaggc caacgagggc 480 

gagaacaact gcctgctgca ccccatgagc cagcacggca tggaggacga ggaccgcgag 540 

gtgctgcgct ggaagttcga cagcaccctg gcccgccgcc acatggcccg cgagctgcac 600 

cccgagtact acaaggactg ctga 624 

<210> 102 

<211> 624 

<212> DNA 

<213> Artificial 

<220> 

< 2 2 3 > Ne f _TV2_C_ZA_wt 
<400> 102 

atggggggca agtggtcaaa aagcagtata attggatggc ctgaagtaag agaaagaatc 60 

agacgaacta ggtcagcagc agagggagta ggatcagcgt ctcaagactt agagaaacat 120 

ggggcactta caaccagcaa cacagcccac aacaatgctg cttgcgcctg gctggaagcg 180 

caagaggagg aaggagaagt aggctttcca gtcagacctc aggtaccttt aagaccaatg 240 

acttataaag cagcaataga tctcagcttc tttttaaaag aaaagggggg actggaaggg 300 

ttaatttact ccaagaaaag gcaagagatc cttgatttgt gggtttataa cacacaaggc 3 60 
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ttcttccctg attggcaaaa ctacacaccg ggaccagggg tcagatttcc actgaccttt 420 

ggatggtact tcaagctaga gccagtcgat ccaagggaag tagaagaggc caatgaagga 480 

gaaaacaact gtttactaca ccctatgagc cagcatggaa tggaggatga agacagagaa 540 

gtattaagat ggaagtttga cagtacgcta gcacgcagac acatggcccg cgagctacat 600 

ccggagtatt acaaagactg ctga 624 

<210> 103 
<211> 3009 
<212> DNA 
<213> Artificial 

<220> 

< 2 2 3 > Pol_'rv2_C_ZAop t 
<400> 103 

ttcttccgcg agaacctggc cttcccccag ggcgaggccc gcgagttccc cagcgagcag 60 

acccgcgcca acagccccac cacccgcacc aacagcccca ccagccgcga gctgcaggtg 120 

cagggcgaca gcgaggccgg cgccgagcgc cagggcacct tcaacttccc ccagatcacc 180 

ctgtggcagc gccccctggt gagcatcaag gtggccggcc agaccaagga ggccctgctg 240 

gacaccggcg ccgacgacac cgtgctggag gagatcaacc tgcccggcaa gtggaagccc 300 

aagatgatcg gcggcatcgg cggcttcatc aaggtgcgcc agtacgacca gatcctgatc 360 

gagatctgcg gcaagcgcgc catcggcacc gtgctggtgg gccccacccc cgtgaacatc 420 

atcggccgca acctgctgac ccagctgggc tgcaccctga acttccccat cagccccatc 480 

gagaccgtgc ccgtgaagct gaagcccggc atggacggcc ccaaggtgaa gcagtggccc 540 

ctgaccgagg agaagatcaa ggccctgacc gagatctgcg aggagatgga gaaggagggc 600 

aagatcacca agatcggccc cgagaacccc tacaacaccc ccgtgttcgc catcaagaag 660 

aaggacagca ccaagtggcg caagctggtg gacttccgcg agctgaacaa gcgcacccag 720 

gacttctggg aggtgcagct gggcatcccc caccccgccg gcctgaagaa gaagaagagc 780 

gtgaccgtgc tggacgtggg cgacgcctac ttcagcgtgc ccctggacga gagcttccgc 840 

aagtacaccg ccttcaccat ccccagcatc aacaacgaga cccccggcat ccgctaccag 900 

tacaacgtgc tgccccaggg ctggaagggc agccccgcca tcttccagag cagcatgacc 960 

cgcatcctgg agcccttccg cacccagaac cccgaggtgg tgatctacca gtacatggac 1020 

gacctgtacg tgggcagcga cctggagatc ggccagcacc gcgccaagat cgaggagctg 1080 
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cgcggccacc tgctgaagtg gggcttcacc acccccgaca agaagcacca gaaggagccc 1140 

cccttcctgt ggatgggcta cgagctgcac cccgacaagt ggaccgtgca gcccatccag 1200 

ctgcccgaga aggagagctg gaccgtgaac gacatccaga agctggtggg caagctgaac 1260 

tgggccagcc agatctaccc cggcatcaag gtgcgccagc tgtgcaagct gctgcgcggc 1320 

gccaaggccc tgaccgacat cgtgcccctg accgaggagg ccgagctgga gctggccgag 1380 

aaccgcgaga tcctgaagga gcccgtgcac ggcgtgtact acgaccccag caaggacctg 1440 

atcgccgaga tccagaagca gggcaacgac cagtggacct accagatcta ccaggagccc 1500 

ttcaagaacc tgcgcaccgg caagtacgcc aagatgcgca ccgcccacac caacgacgtg 1560 

aagcagctgg ccgaggccgt gcagaagatc acccaggaga gcatcgtgat ctggggcaag 1620 

acccccaagt tccgcctgcc catccccaag gagacctggg agacctggtg gagcgactac 1680 

tggcaggcca cctggatccc cgagtgggag ttcgtgaaca ccccccccct ggtgaagctg 1740 

tggtaccagc tggagaagga gcccatcgtg ggcgccgaga ccttctacgt ggacggcgcc 1800 

gccaaccgcg agaccaagat cggcaaggcc ggctacgtga ccgacaaggg ccgccagaag 1860 

gtggtgagct tcaccgagac caccaaccag aagaccgagc tgcaggccat ccagctggcc 1920 

ctgcaggaca gcggccccga ggtgaacatc gtgaccgaca gccagtacgc cctgggcatc 1980 

atccaggccc agcccgacaa gagcgagagc gagctggtga gccagatcat cgagcagctg 2040 

atcaagaagg agaaggtgta cctgagctgg gtgcccgccc acaagggcat cggcggcaac 2100 

gagcaggtgg acaagctggt gagcagcggc atccgcaagg tgctgttcct ggacggcatc 2160 

gacaaggccc aggaggagca cgagaagtac cacagcaact ggcgcgccat ggccagcgag 2220 

ttcaacctgc cccccatcgt ggccaaggag atcgtggcca gctgcgacaa gtgccagctg 2280 

aagggcgagg ccatgcacgg ccaggtggac tgcagccccg gcatctggca gctggactgc 2340 

acccacctgg agggcaagat catcctggtg gccgtgcacg tggccagcgg ctacatggag 2400 

gccgaggtga tccccgccga gaccggccag gagaccgcct acttcatcct gaagctggcc 2460 

ggccgctggc ccgtgaaggt gatccacacc gacaacggca gcaacttcac cagcaccgcc 2520 

gtgaaggccg cctgctggtg ggccgacatc cagcgcgagt tcggcatccc ctacaacccc 2580 

cagagccagg gcgtggtgga gagcatgaac aaggagctga agaagatcat cggccaggtg 2640 

cgcgaccagg ccgagcacct gaagaccgcc gtgcagatgg ccgtgttcat ccacaacttc 2700 

aagcgcaagg gcggcatcgg cggctacagc gccggcgagc gcatcatcga catcatcgcc 2760 

agcgacatcc agaccaagga gctgcagaag cagatcatca agatccagaa cttccgcgtg 2820 
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tactaccgcg acagccgcga ccccatctgg aagggccccg ccaagctgct gtggaagggc 2880 

gagggcgccg tggtgatcca ggacaacagc gacatcaagg tggtgccccg ccgcaaggcc 2940 

aagatcatca aggactacgg caagcagatg gccggcgccg actgcgtggc cggccgccag 3000 

gacgaggac 3009 

<210> 104 
<211> 3009 
<212> DNA 
<213> Artificial 

<220> 

< 2 2 3 > Pol_TV2„C_ZAwt 
<400> 104 

ttttttaggg aaaatttggc cttcccacaa ggggaggcca gggaatttcc ttcagagcag 60 

accagagcca acagccccac cactagaacc aacagcccca ccagcagaga gcttcaagtt 120 

caaggagact ccgaagcagg agccgaaaga cagggaacct ttaacttccc tcaaatcact 180 

ctttggcagc gaccccttgt ctcaataaaa gtagcgggcc aaacaaagga ggctctttta 240 

gatacaggag cagatgatac agtactagaa gaaataaact tgccaggaaa atggaaacca 300 

aaaatgatag gaggaattgg aggttttatc aaagtaagac agtatgatca aatacttata 360 

gaaatttgtg gaaaaagggc tataggtaca gtattagtag gacctacacc tgtcaacata 420 

attggaagaa atctgttgac tcagcttgga tgcacactaa attttccaat tagccccatt 480 

gaaactgtac cagtaaaatt aaagccagga atggatggcc caaaggttaa acaatggcca 540 

ttgacagaag aaaaaataaa agcattaaca gaaatttgtg aggaaatgga gaaggaagga 600 

aaaattacaa aaattgggcc tgaaaatcca tataacactc cagtatttgc cataaagaag 660 

aaggacagta caaagtggag aaaattagta gatttcaggg aactcaataa aagaactcaa 720 

gacttttggg aagtccaatt aggaatacca cacccagcag ggttaaaaaa gaaaaaatca 780 

gtgacagtac tggatgtggg agatgcatat ttttcagtcc ctttagatga gagcttcaga 840 

aaatatactg cattcaccat acctagtata aacaatgaaa caccagggat tagatatcaa 900 

tataatgttc ttccacaggg atggaaagga tcaccagcaa tattccagag tagcatgaca 960 

agaatcttag agccctttag aacacaaaac ccagaagtag ttatctatca atatatggat 1020 

gacttatatg taggatctga cttagaaata gggcaacata gagcaaaaat agaggagtta 1080 

agaggacacc tattgaaatg gggatttacc acaccagaca agaaacatca gaaagaaccc 1140 
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ccatttcttt ggatggggta tgaactccat cctgacaaat ggacagtaca gcctatacag 12 00 

ctgccagaaa aggagagctg gactgtcaat gatatacaga agttagtggg aaagttaaac 12 60 

tgggcaagtc agatttaccc agggattaaa gtaaggcaac tgtgtaaact ccttagggga 1320 

gccaaagcac taacagacat agtgccactg actgaagaag cagaattaga attggctgag 1380 

aacagggaaa ttctaaaaga accagtacat ggagtatatt atgacccatc aaaagattta 1440 

atagctgaaa tacagaaaca ggggaatgac caatggacat atcaaattta ccaagaacca 1500 

tttaaaaatc tgagaacagg aaagtatgca aaaatgagga ctgcccacac taatgatgtg 1560 

aaacagttag cagaggcagt gcaaaagata acccaggaaa gcatagtaat atggggaaaa 1620 

actcctaaat ttagactacc catcccaaaa gaaacatggg agacatggtg gtcagactat 1680 

tggcaagcca cctggattcc tgagtgggag tttgtcaata cccctcccct agtaaaattg 1740 

tggtaccagc tggaaaaaga acccatagta ggggcagaaa ctttctatgt agatggagca 1800 

gccaataggg aaactaaaat aggaaaagca gggtatgtca ctgacaaagg aaggcagaaa 1860 

gttgtttcct tcactgaaac aacaaatcag aagactgaat tacaagcaat tcagctagct 1920 

ttgcaggatt cagggccaga agtaaacata gtaacagact cacagtatgc attaggaatc 1980 

attcaagcac aaccagataa gagtgaatca gaattagtca gtcaaataat agaacagttg 2040 

ataaaaaagg aaaaagtcta cctatcatgg gtaccagcac ataaaggaat tggaggaaat 2100 

gaacaagtag acaaattagt aagtagtgga atcagaaaag tactgtttct agatggaata 2160 

gataaagctc aagaagagca tgaaaaatat cacagcaatt ggagagcaat ggctagtgag 2220 

tttaatctgc cacccatagt agcaaaggaa atagtagcca gctgtgataa atgtcagcta 2280 

aaaggggaag ccatgcatgg acaagtcgac tgtagtccag gaatatggca attagactgt 2340 

acacatttag aaggaaaaat catcctagta gcagtccatg tagccagtgg ctacatggaa 2400 

gcagaggtta tcccagcaga aacaggacaa gaaacagcat actttatact aaaattagca 2460 

ggaagatggc cagtcaaagt aatacataca gataatggca gtaatttcac cagtaccgca 2520 

gttaaggcag cctgttggtg ggcagatatc caacgggaat ttggaattcc ctacaatccc 2580 

caaagtcaag gagtagtaga atccatgaat aaagaattaa agaaaatcat agggcaagta 2640 

agagatcaag ctgagcacct taagacagca gtacaaatgg cagtattcat tcacaatttt 2700 

aaaagaaaag gggggattgg ggggtacagt gcaggggaga gaataataga cataatagca 2760 

tcagacatac aaactaaaga attacaaaaa caaattataa aaattcaaaa ttttcgggtt 2820 

tattacagag acagcagaga ccctatttgg aaaggaccag ccaaactact ctggaaaggt 2880 
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gaaggggcag tagtaataca agataatagt gatataaagg tagtaccaag aaggaaagca 
aaaatcatta aggactatgg aaaacagatg gcaggtgctg attgtgtggc aggtagacag 
gatgaagat 



<210> 105 

<211> 75 

<212> DNA 

<213> Artificial 

<220> 

< 2 2 3 > RevExonl_TV2_C_ZAopt 
<400> 105 

atggccggcc gcagcggcga cagcgacgag gccctgctgc aggccatcaa gatcatcaag 
atcctgtacc agagc 



<210> 106 

<211> 76 

<212> DNA 

<213> Artificial 

<220> 

< 2 2 3 > RevExonl_TV2_C_ZAwt 
<400> 106 

atggcaggaa gaagcggaga cagcgacgaa gcgctcctcc aagcaataaa gatcatcaag 
atcctctacc aaagca 



<210> 107 

<211> 246 

<212> DNA 

<213> Artificial 

<220> 

< 2 2 3 > RevExon2_TV2_C_ZAopt 
<400> 107 

ccctacccca agcccgaggg cacccgccag gcccgccgca accgccgccg ccgctggcgc 
gcccgccagc agcagatcca cagcatcagc gagcgcatcc tggacacctg cctgggccgc 
cccaccaagc ccgtgcccct gctgctgccc cccatcgagc gcctgcacat caactgcagc 
gagagcagcg gcaccagcgg cacccagtag agccagggca ccgccgaggg cgtgggcaac 
ccctaa 



<210> 108 
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<211> 248 
<212> DNA 
<213> Artificial 



<220> 

< 2 2 3 > RevExon2_TV2_C_ZAwt 



<400> 108 
acccttatcc 



caaacccgag gggacccgac aggctcggag gaatcgaaga agaaggtgga 



60 



gagcaagaca 



gcagcagatc cattcgatta gtgagcggat tcttgacact tgcctgggac 



120 



gacctacgaa gcctgtgcct cttctgctac caccgattga gagacttcat attaattgta 180 



<210> 109 

<211> 216 

<212> DNA 

<213> Artificial 

<220> 

<223> TatExonl_TV2_C_ZAopt 
<400> 109 

atggagccca tcgaccccaa cctggagccc tggaaccacc ccggcagcca gcccaagacc 60 
gcctgcaacg gctgctactg caagcgctgc agctaccact gcctggtgtg cttccagaag 120 
aagggcctgg gcatctacta cggccgcaag aagcgccgcc agcgccgcag cgcccccccc 180 
agcaacaagg accaccagga ccccctgccc aagcag 216 



<210> 110 

<211> 216 

<212> DNA 

<213> Artificial 

<220> 

<2 2 3 > TatExonl_TV2_C_ZAwt 
<400> 110 

atggagccaa tagatcctaa cctagaaccc tggaaccatc caggaagtca gcctaaaact 60 
gcttgtaatg ggtgttactg taaacgttgc agctatcatt gtctagtttg ctttcagaaa 120 
aaaggcttag gcatttacta tggcaggaag aagcggagac agcgacgaag cgctcctcca 180 
agcaataaag atcatcaaga tcctctacca aagcag 216 



<210> 111 
<211> 90 



gtgagagcag 



tggaacttct gggacacagt agtctcaggg gactgcagag ggggtgggga 



240 



acccttaa 



248 



<212> DNA 

<213> Artificial 

<220> 

<223> Tat Exon 2 _TV2 _C_Z Aop t 
<400> 111 

cccctgagcc agacccgcgg cgaccccacc ggcagcgagg agagcaagaa gaaggtggag 
agcaagaccg ccgccgaccc cttcgactag 



<210> 112 

<211> 90 

<212> DNA 

<213> Artificial 

<220> 

< 2 2 3 > Tat Exon2_TV2_C_ZAwt 
<400> 112 

cccttatccc aaacccgagg ggacccgaca ggctcggagg aatcgaagaa gaaggtggag 
agcaagacag cagcagatcc attcgattag 

<210> 113 

<211> 579 

<212> DNA 

<213> Artificial 

<220> 

<2 2 3 > Vi f _TV2_C_ZAopt 
<400> 113 

atggagaacc gctggcaggt gctgatcgtg tggcaggtgg accgcatgaa gatccgcacc 
tggcacagcc tggtgaagca ccacatgtac gtgagccgcc gcgccgacgg ctggttctac 
cgccaccact acgagagccg ccaccccaag gtgagcagcg aggtgcacat ccccctgggc 
gacgcccgcc tggtgatcaa gacctactgg ggcctgcaga ccggcgagcg cgcctggcac 
ctgggccacg gcgtgagcat cgagtggcgc ctgcgccgct acagcaccca ggtggacccc 
gacctgaccg accagctgat ccacatgcac tacttcgact gcttcgccga gagcgccatc 
cgcaaggcca tcctgggcca gatcgtgagc cccaagtgcg actaccaggc cggccacaac 
aaggtgggca gcctgcagta cctggccctg accgccctga tcaagcccaa gaagatcaag 
ccccccctgc ccagcgtgcg caagctggtg gaggaccgct ggaacaagcc ccagaagacc 
cgcggccgcc gcggcaacca caccatgaac ggccactag 

<210> 114 
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<211> 579 
<212> DNA 
<213> Artificial 

<220> 

< 2 2 3 > Vi f _TV2 _C_ZAwt 
<400> 114 

atggaaaaca gatggcaggt gctgattgtg tggcaggtag acaggatgaa gattagaaca 60 

tggcacagtt tagtaaagca ccatatgtat gtttcgagga gagctgatgg atggttctac 120 

agacatcatt atgaaagcag acacccaaaa gtaagttcag aagtacacat cccattagga 180 

gatgccaggt tagtaataaa aacatattgg ggtctgcaga caggagaaag agcttggcat 240 

ttgggtcacg gagtctccat agaatggaga ttgagaagat atagcacaca agtagaccct 300 

gacctgacag accaactaat tcatatgcat tattttgatt gttttgcaga atctgccata 360 

aggaaagcca tactaggaca gatagttagc cctaagtgtg actatcaagc aggacataac 420 

aaggtaggat ctctacaata cttggcactg acagcattga taaaaccaaa aaagataaag 480 

ccacctctgc ctagtgttag gaaattagta gaggatagat ggaacaagcc ccagaagacc 540 

aggggccgca gagggaacca tacaatgaat ggacactag 579 

<210> 115 

<211> 288 

<212> DNA 

<213> Artificial 

<220> 

< 2 2 3 > Vpr_TV2_C_ZAop t 
<400> 115 

atggagcagg cccccgagga ccagggcccc cagcgcgagc cctacaacga gtggaccctg 60 
gagctgatgg aggagctgaa gcaggaggcc gtgcgccact tcccccgccc ctggctgcac 120 
aacctgggcc agcacatcta cgagacctac ggcgacacct ggaccggcgt ggaggccatc 180 
atccgcatcc tgcagcagct gctgttcatc cacttccgca tcggctgcca ccacagccgc 240 
atcggcatcc tgcgccagcg ccgcgcccgc aacggcgcca accgcagc 288 



<210> 116 

<211> 288 

<212> DNA 

<213> Artificial 

<220> 

< 2 2 3 > Vpr_TV2_C_ZAwt 
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<400> 116 
atggaacaag 



ccccagaaga ccaggggccg cagagggaac catacaatga atggacacta 



60 



gagcttttag 



aagaactcaa gcaggaagct gtcagacact ttcctagacc atggctccat 



120 



aacttaggac 



aacatatcta tgaaacctat ggagatactt ggacaggagt tgaagcaata 



180 



ataagaatcc 



tgcaacaatt actgtttatt catttcagga ttgggtgcca tcatagcaga 



240 



ataggcattt 



tgcgacagag aagagcaaga aatggagcca atagatcc 



288 



<210> 117 

<211> 261 

<212> DNA 

<213> Artificial 

<220> 

< 2 2 3 > Vpu_TV2_C_Z Aop t 
<400> 117 

atgctggacc tgaccgcccg catcgacagc cgcctgggca tcggcgccct gatcgtggcc 60 
ctgatcatcg ccatcatcgt gtggaccatc gtgtacatcg agtaccgcaa gctggtgcgc 120 
cagcgcaaga tcgactggct ggtgaagcgc atccgcgagc gcgccgagga cagcggcaac 180 
gagagcgagg gcgacaccga ggagctgagc accctggtgg acatgggcca cctgcgcctg 240 
ctggacgcca acgacgtgta a 261 



<210> 118 

<211> 261 

<212> DNA 

<213> Artificial 

<220> 

< 2 2 3 > Vpu_TV2_C_ZAwt 
<400> 118 

atgttagatt taactgcaag aatagattct agattaggaa taggagcatt gatagtagca 60 
ctaatcatag caataatagt gtggaccata gtatatatag aatataggaa attggtaagg 120 
caaaggaaaa tagactggtt agttaaaagg attagggaaa gagcagaaga cagtggcaat 180 
gagagcgagg gggatactga agaattatcg acactggtgg atatggggca tcttaggctt 240 
ttggatgcta atgatgtgta a 261 



<210> 119 

<211> 1473 

<212> DNA 

<213> Artificial 
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<220> 

< 2 2 3 > gpl2 Omod . TVl . delV2 
<400> 119 

gaattcatgc gcgtgatggg cacccagaag aactgccagc agtggtggat ctggggcatc 60 

ctgggcttct ggatgctgat gatctgcaac accgaggacc tgtgggtgac cgtgtactac 120 

ggcgtgcccg tgtggcgcga cgccaagacc accctgttct gcgccagcga cgccaaggcc 180 

tacgagaccg aggtgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac 240 

ccccaggaga tcgtgctggg caacgtgacc gagaacttca acatgtggaa gaacgacatg 300 

gccgaccaga tgcacgagga cgtgatcagc ctgtgggacc agagcctgaa gccctgcgtg 360 

aagctgaccc ccctgtgcgt gaccctgaac tgcaccgaca ccaacgtgac cggcaaccgc 420 

accgtgaccg gcaacagcac caacaacacc aacggcaccg gcatctacaa catcgaggag 480 

atgaagaact gcagcttcaa cgccggcgcc ggccgcctga tcaactgcaa caccagcacc 540 

atcacccagg cctgccccaa ggtgagcttc gaccccatcc ccatccacta ctgcgccccc 600 

gccggctacg ccatcctgaa gtgcaacaac aagaccttca acggcaccgg cccctgctac 660 

aacgtgagca ccgtgcagtg cacccacggc atcaagcccg tggtgagcac ccagctgctg 720 

ctgaacggca gcctggccga ggagggcatc atcatccgca gcgagaacct gaccgagaac 780 

accaagacca tcatcgtgca cctgaacgag agcgtggaga tcaactgcac ccgccccaac 840 

aacaacaccc gcaagagcgt gcgcatcggc cccggccagg ccttctacgc caccaacgac 900 

gtgatcggca acatccgcca ggcccactgc aacatcagca ccgaccgctg gaacaagacc 960 

ctgcagcagg tgatgaagaa gctgggcgag cacttcccca acaagaccat ccagttcaag 1020 

ccccacgccg gcggcgacct ggagatcacc atgcacagct tcaactgccg cggcgagttc 1080 

ttctactgca acaccagcaa cctgttcaac agcacctacc acagcaacaa cggcacctac 1140 

aagtacaacg gcaacagcag cagccccatc accctgcagt gcaagatcaa gcagatcgtg 1200 

cgcatgtggc agggcgtggg ccaggccacc tacgcccccc ccatcgccgg caacatcacc 1260 

tgccgcagca acatcaccgg catcctgctg acccgcgacg gcggcttcaa caccaccaac 1320 

aacaccgaga ccttccgccc cggcggcggc gacatgcgcg acaactggcg cagcgagctg 1380 

tacaagtaca aggtggtgga gatcaagccc ctgggcatcg cccccaccaa ggccaagcgc 1440 

cgcgtggtgc agcgcgagaa gcgctaactc gag 1473 

<210> 120 
<211> 1986 
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<212> DNA 

<213> Artificial 

<220> 

<223> gpl40mod.TVl.delV2 
<400> 120 

gaattcatgc gcgtgatggg cacccagaag aactgccagc agtggtggat ctggggcatc 60 

ctgggcttct ggatgctgat gatctgcaac accgaggacc tgtgggtgac cgtgtactac 120 

ggcgtgcccg tgtggcgcga cgccaagacc accctgttct gcgccagcga cgccaaggcc 180 

tacgagaccg aggtgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac 240 

ccccaggaga tcgtgctggg caacgtgacc gagaacttca acatgtggaa gaacgacatg 3 00 

gccgaccaga tgcacgagga cgtgatcagc ctgtgggacc agagcctgaa gccctgcgtg 360 

aagctgaccc ccctgtgcgt gaccctgaac tgcaccgaca ccaacgtgac cggcaaccgc 420 

accgtgaccg gcaacagcac caacaacacc aacggcaccg gcatctacaa catcgaggag 480 

atgaagaact gcagcttcaa cgccggcgcc ggccgcctga tcaactgcaa caccagcacc 540 

atcacccagg cctgccccaa ggtgagcttc gaccccatcc ccatccacta ctgcgccccc 600 

gccggctacg ccatcctgaa gtgcaacaac aagaccttca acggcaccgg cccctgctac 660 

aacgtgagca ccgtgcagtg cacccacggc atcaagcccg tggtgagcac ccagctgctg 720 

ctgaacggca gcctggccga ggagggcatc atcatccgca gcgagaacct gaccgagaac 780 

accaagacca tcatcgtgca cctgaacgag agcgtggaga tcaactgcac ccgccccaac 840 

aacaacaccc gcaagagcgt gcgcatcggc cccggccagg ccttctacgc caccaacgac 900 

gtgatcggca acatccgcca ggcccactgc aacatcagca ccgaccgctg gaacaagacc 960 

ctgcagcagg tgatgaagaa gctgggcgag cacttcccca acaagaccat ccagttcaag 1020 

ccccacgccg gcggcgacct ggagatcacc atgcacagct tcaactgccg cggcgagttc 1080 

ttctactgca acaccagcaa cctgttcaac agcacctacc acagcaacaa cggcacctac 1140 

aagtacaacg gcaacagcag cagccccatc accctgcagt gcaagatcaa gcagatcgtg 1200 

cgcatgtggc agggcgtggg ccaggccacc tacgcccccc ccatcgccgg caacatcacc 12 60 

tgccgcagca acatcaccgg catcctgctg acccgcgacg gcggcttcaa caccaccaac 1320 

aacaccgaga ccttccgccc cggcggcggc gacatgcgcg acaactggcg cagcgagctg 1380 

tacaagtaca aggtggtgga gatcaagccc ctgggcatcg cccccaccaa ggccaagcgc 1440 

cgcgtggtgc agcgcgagaa gcgcgccgtg ggcatcggcg ccgtgttcct gggcttcctg 1500 
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ggcgccgccg gcagcaccat gggcgccgcc agcatcaccc tgaccgtgca ggcccgccag 1560 

ctgctgagcg gcatcgtgca gcagcagagc aacctgctga aggccatcga ggcccagcag 1620 

cacatgctgc agctgaccgt gtggggcatc aagcagctgc aggcccgcgt gctggccatc 1680 

gagcgctacc tgaaggacca gcagctgctg ggcatctggg gctgcagcgg ccgcctgatc 1740 

tgcaccaccg ccgtgccctg gaacagcagc tggagcaaca agagcgagaa ggacatctgg 1800 

gacaacatga cctggatgca gtgggaccgc gagatcagca actacaccgg cctgatctac 1860 

aacctgctgg aggacagcca gaaccagcag gagaagaacg agaaggacct gctggagctg 192 0 

gacaagtgga acaacctgtg gaactggttc gacatcagca actggccctg gtacatctaa 1980 

ctcgag 1986 

<210> 121 
<211> 1986 
<212> DNA 
<213> Artificial 

<220> 

<223> gpl40mod.TVl .mutV .delV2 
<400> 121 

gaattcatgc gcgtgatggg cacccagaag aactgccagc agtggtggat ctggggcatc 60 

ctgggcttct ggatgctgat gatctgcaac accgaggacc tgtgggtgac cgtgtactac 12 0 

ggcgtgcccg tgtggcgcga cgccaagacc accctgttct gcgccagcga cgccaaggcc 180 

tacgagaccg aggtgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac 240 

ccccaggaga tcgtgctggg caacgtgacc gagaacttca acatgtggaa gaacgacatg 300 

gccgaccaga tgcacgagga cgtgatcagc ctgtgggacc agagcctgaa gccctgcgtg 360 

aagctgaccc ccctgtgcgt gaccctgaac tgcaccgaca ccaacgtgac cggcaaccgc 420 

accgtgaccg gcaacagcac caacaacacc aacggcaccg gcatctacaa catcgaggag 480 

atgaagaact gcagcttcaa cgccggcgcc ggccgcctga tcaactgcaa caccagcacc 540 

atcacccagg cctgccccaa ggtgagcttc gaccccatcc ccatccacta ctgcgccccc 600 

gccggctacg ccatcctgaa gtgcaacaac aagaccttca acggcaccgg cccctgctac 660 

aacgtgagca ccgtgcagtg cacccacggc atcaagcccg tggtgagcac ccagctgctg 720 

ctgaacggca gcctggccga ggagggcatc atcatccgca gcgagaacct gaccgagaac 780 

accaagacca tcatcgtgca cctgaacgag agcgtggaga tcaactgcac ccgccccaac 840 

aacaacaccc gcaagagcgt gcgcatcggc cccggccagg ccttctacgc caccaacgac 900 
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gtgatcggca acatccgcca ggcccactgc aacatcagca ccgaccgctg gaacaagacc 960 

ctgcagcagg tgatgaagaa gctgggcgag cacttcccca acaagaccat ccagttcaag 1020 

ccccacgccg gcggcgacct ggagatcacc atgcacagct tcaactgccg cggcgagttc 1080 

ttctactgca acaccagcaa cctgttcaac agcacctacc acagcaacaa cggcacctac 1140 

aagtacaacg gcaacagcag cagccccatc accctgcagt gcaagatcaa gcagatcgtg 1200 

cgcatgtggc agggcgtggg ccaggccacc tacgcccccc ccatcgccgg caacatcacc 12 60 

tgccgcagca acatcaccgg catcctgctg acccgcgacg gcggcttcaa caccaccaac 1320 

aacaccgaga ccttccgccc cggcggcggc gacatgcgcg acaactggcg cagcgagctg 1380 

tacaagtaca aggtggtgga gatcaagccc ctgggcatcg cccccaccaa ggccatcagc 1440 

agcgtggtgc agagcgagaa gagcgccgtg ggcatcggcg ccgtgttcct gggcttcctg 1500 

ggcgccgccg gcagcaccat gggcgccgcc agcatcaccc tgaccgtgca ggcccgccag 1560 

ctgctgagcg gcatcgtgca gcagcagagc aacctgctga aggccatcga ggcccagcag 1620 

cacatgctgc agctgaccgt gtggggcatc aagcagctgc aggcccgcgt gctggccatc 1680 

gagcgctacc tgaaggacca gcagctgctg ggcatctggg gctgcagcgg ccgcctgatc 1740 

tgcaccaccg ccgtgccctg gaacagcagc tggagcaaca agagcgagaa ggacatctgg 1800 

gacaacatga cctggatgca gtgggaccgc gagatcagca actacaccgg cctgatctac 1860 

aacctgctgg aggacagcca gaaccagcag gagaagaacg agaaggacct gctggagctg 1920 

gacaagtgga acaacctgtg gaactggttc gacatcagca actggccctg gtacatctaa 1980 

ctcgag 1986 

<210> 122 
<211> 2397 
<212> DNA 
<213> Artificial 

<220> 

<223> gpl6 Omod . TVl . de 1V1V2 
<400> 122 

gaattcatgc gcgtgatggg cacccagaag aactgccagc agtggtggat ctggggcatc 60 

ctgggcttct ggatgctgat gatctgcaac accgaggacc tgtgggtgac cgtgtactac 120 

ggcgtgcccg tgtggcgcga cgccaagacc accctgttct gcgccagcga cgccaaggcc 180 

tacgagaccg aggtgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac 240 
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ccccaggaga tcgtgctggg caacgtgacc gagaacttca acatgtggaa gaacgacatg 3 00 

gccgaccaga tgcacgagga cgtgatcagc ctgtgggacc agagcctgaa gccctgcgtg 360 

aagctgaccc ccctgtgcgt gggcgccggc aactgcaaca ccagcaccat cacccaggcc 420 

tgccccaagg tgagcttcga ccccatcccc atccactact gcgcccccgc cggctacgcc 480 

atcctgaagt gcaacaacaa gaccttcaac ggcaccggcc cctgctacaa cgtgagcacc 540 

gtgcagtgca cccacggcat caagcccgtg gtgagcaccc agctgctgct gaacggcagc 600 

ctggccgagg agggcatcat catccgcagc gagaacctga ccgagaacac caagaccatc 660 

atcgtgcacc tgaacgagag cgtggagatc aactgcaccc gccccaacaa caacacccgc 720 

aagagcgtgc gcatcggccc cggccaggcc ttctacgcca ccaacgacgt gatcggcaac 780 

atccgccagg cccactgcaa catcagcacc gaccgctgga acaagaccct gcagcaggtg 840 

atgaagaagc tgggcgagca cttccccaac aagaccatcc agttcaagcc ccacgccggc 900 

ggcgacctgg agatcaccat gcacagcttc aactgccgcg gcgagttctt ctactgcaac 960 

accagcaacc tgttcaacag cacctaccac agcaacaacg gcacctacaa gtacaacggc 1020 

aacagcagca gccccatcac cctgcagtgc aagatcaagc agatcgtgcg catgtggcag 1080 

ggcgtgggcc aggccaccta cgcccccccc atcgccggca acatcacctg ccgcagcaac 1140 

atcaccggca tcctgctgac ccgcgacggc ggcttcaaca ccaccaacaa caccgagacc 12 00 

ttccgccccg gcggcggcga catgcgcgac aactggcgca gcgagctgta caagtacaag 12 60 

gtggtggaga tcaagcccct gggcatcgcc cccaccaagg ccaagcgccg cgtggtgcag 1320 

cgcgagaagc gcgccgtggg catcggcgcc gtgttcctgg gcttcctggg cgccgccggc 1380 

agcaccatgg gcgccgccag catcaccctg accgtgcagg cccgccagct gctgagcggc 1440 

atcgtgcagc agcagagcaa cctgctgaag gccatcgagg cccagcagca catgctgcag 1500 

ctgaccgtgt ggggcatcaa gcagctgcag gcccgcgtgc tggccatcga gcgctacctg 1560 

aaggaccagc agctgctggg catctggggc tgcagcggcc gcctgatctg caccaccgcc 1620 

gtgccctgga acagcagctg gagcaacaag agcgagaagg acatctggga caacatgacc 1680 

tggatgcagt gggaccgcga gatcagcaac tacaccggcc tgatctacaa cctgctggag 1740 

gacagccaga accagcagga gaagaacgag aaggacctgc tggagctgga caagtggaac 1800 

aacctgtgga actggttcga catcagcaac tggccctggt acatcaagat cttcatcatg 1860 

atcgtgggcg gcctgatcgg cctgcgcatc atcttcgccg tgctgagcat cgtgaaccgc 192 0 

gtgcgccagg gctacagccc cctgagcttc cagaccctga cccccagccc ccgcggcctg 1980 
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gaccgcctgg gcggcatcga ggaggagggc ggcgagcagg accgcgaccg cagcatccgc 2040 

ctggtgagcg gcttcctgag cctggcctgg gacgacctgc gcaacctgtg cctgttcagc 2100 

taccaccgcc tgcgcgactt catcctgatc gccgtgcgcg ccgtggagct gctgggccac 2160 

agcagcctgc gcggcctgca gcgcggctgg gagatcctga agtacctggg cagcctggtg 2220 

cagtactggg gcctggagct gaagaagagc gccatcagcc tgctggacac catcgccatc 2280 

accgtggccg agggcaccga ccgcatcatc gagctggtgc agcgcatctg ccgcgccatc 2340 

ctgaacatcc cccgccgcat ccgccagggc ttcgaggccg ccctgctgta actcgag 2397 

<210> 123 
<211> 2529 
<212> DNA 
<213> Artificial 

<220> 

<223> gpl60mod.TVl.delV2 
<400> 123 

gaattcatgc gcgtgatggg cacccagaag aactgccagc agtggtggat ctggggcatc 60 

ctgggcttct ggatgctgat gatctgcaac accgaggacc tgtgggtgac cgtgtactac 120 

ggcgtgcccg tgtggcgcga cgccaagacc accctgttct gcgccagcga cgccaaggcc 180 

tacgagaccg aggtgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac 240 

ccccaggaga tcgtgctggg caacgtgacc gagaacttca acatgtggaa gaacgacatg 300 

gccgaccaga tgcacgagga cgtgatcagc ctgtgggacc agagcctgaa gccctgcgtg 360 

aagctgaccc ccctgtgcgt gaccctgaac tgcaccgaca ccaacgtgac cggcaaccgc 420 

accgtgaccg gcaacagcac caacaacacc aacggcaccg gcatctacaa catcgaggag 480 

atgaagaact gcagcttcaa cgccggcgcc ggccgcctga tcaactgcaa caccagcacc 540 

atcacccagg cctgccccaa ggtgagcttc gaccccatcc ccatccacta ctgcgccccc 600 

gccggctacg ccatcctgaa gtgcaacaac aagaccttca acggcaccgg cccctgctac 660 

aacgtgagca ccgtgcagtg cacccacggc atcaagcccg tggtgagcac ccagctgctg 72 0 

ctgaacggca gcctggccga ggagggcatc atcatccgca gcgagaacct gaccgagaac 780 

accaagacca tcatcgtgca cctgaacgag agcgtggaga tcaactgcac ccgccccaac 840 

aacaacaccc gcaagagcgt gcgcatcggc cccggccagg ccttctacgc caccaacgac 900 

gtgatcggca acatccgcca ggcccactgc aacatcagca ccgaccgctg gaacaagacc 960 
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ctgcagcagg tgatgaagaa gctgggcgag cacttcccca acaagaccat ccagttcaag 102 0 

ccccacgccg gcggcgacct ggagatcacc atgcacagct tcaactgccg cggcgagttc 1080 

ttctactgca acaccagcaa cctgttcaac agcacctacc acagcaacaa cggcacctac 1140 

aagtacaacg gcaacagcag cagccccatc accctgcagt gcaagatcaa gcagatcgtg 1200 

cgcatgtggc agggcgtggg ccaggccacc tacgcccccc ccatcgccgg caacatcacc 1260 

tgccgcagca acatcaccgg catcctgctg acccgcgacg gcggcttcaa caccaccaac 1320 

aacaccgaga ccttccgccc cggcggcggc gacatgcgcg acaactggcg cagcgagctg 1380 

tacaagtaca aggtggtgga gatcaagccc ctgggcatcg cccccaccaa ggccaagcgc 1440 

cgcgtggtgc agcgcgagaa gcgcgccgtg ggcatcggcg ccgtgttcct gggcttcctg 1500 

ggcgccgccg gcagcaccat gggcgccgcc agcatcaccc tgaccgtgca ggcccgccag 1560 

ctgctgagcg gcatcgtgca gcagcagagc aacctgctga aggccatcga ggcccagcag 162 0 

cacatgctgc agctgaccgt gtggggcatc aagcagctgc aggcccgcgt gctggccatc 1680 

gagcgctacc tgaaggacca gcagctgctg ggcatctggg gctgcagcgg ccgcctgatc 1740 

tgcaccaccg ccgtgccctg gaacagcagc tggagcaaca agagcgagaa ggacatctgg 1800 

gacaacatga cctggatgca gtgggaccgc gagatcagca actacaccgg cctgatctac 1860 

aacctgctgg aggacagcca gaaccagcag gagaagaacg agaaggacct gctggagctg 192 0 

gacaagtgga acaacctgtg gaactggttc gacatcagca actggccctg gtacatcaag 1980 

atcttcatca tgatcgtggg cggcctgatc ggcctgcgca tcatcttcgc cgtgctgagc 2040 

atcgtgaacc gcgtgcgcca gggctacagc cccctgagct tccagaccct gacccccagc 2100 

ccccgcggcc tggaccgcct gggcggcatc gaggaggagg gcggcgagca ggaccgcgac 2160 

cgcagcatcc gcctggtgag cggcttcctg agcctggcct gggacgacct gcgcaacctg 222 0 

tgcctgttca gctaccaccg cctgcgcgac ttcatcctga tcgccgtgcg cgccgtggag 2280 

ctgctgggcc acagcagcct gcgcggcctg cagcgcggct gggagatcct gaagtacctg 2340 

ggcagcctgg tgcagtactg gggcctggag ctgaagaaga gcgccatcag cctgctggac 2400 

accatcgcca tcaccgtggc cgagggcacc gaccgcatca tcgagctggt gcagcgcatc 2460 

tgccgcgcca tcctgaacat cccccgccgc atccgccagg gcttcgaggc cgccctgctg 2520 

taactcgag 2529 

<210> 124 
<211> 2529 
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<212> DNA 

<213> Artificial 

<220> 

<223> gpieOmod.TVl .mut7 .delV2 
<400> 124 

gaattcatgc gcgtgatggg cacccagaag aactgccagc agtggtggat ctggggcatc 60 

ctgggcttct ggatgctgat gatctgcaac accgaggacc tgtgggtgac cgtgtactac 120 

ggcgtgcccg tgtggcgcga cgccaagacc accctgttct gcgccagcga cgccaaggcc 180 

tacgagaccg aggtgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac 240 

ccccaggaga tcgtgctggg caacgtgacc gagaacttca acatgtggaa gaacgacatg 300 

gccgaccaga tgcacgagga cgtgatcagc ctgtgggacc agagcctgaa gccctgcgtg 360 

aagctgaccc ccctgtgcgt gaccctgaac tgcaccgaca ccaacgtgac cggcaaccgc 420 

accgtgaccg gcaacagcac caacaacacc aacggcaccg gcatctacaa catcgaggag 480 

atgaagaact gcagcttcaa cgccggcgcc ggccgcctga tcaactgcaa caccagcacc 540 

atcacccagg cctgccccaa ggtgagcttc gaccccatcc ccatccacta ctgcgccccc 600 

gccggctacg ccatcctgaa gtgcaacaac aagaccttca acggcaccgg cccctgctac 660 

aacgtgagca ccgtgcagtg cacccacggc atcaagcccg tggtgagcac ccagctgctg 720 

ctgaacggca gcctggccga ggagggcatc atcatccgca gcgagaacct gaccgagaac 780 

accaagacca tcatcgtgca cctgaacgag agcgtggaga tcaactgcac ccgccccaac 840 

aacaacaccc gcaagagcgt gcgcatcggc cccggccagg ccttctacgc caccaacgac 900 

gtgatcggca acatccgcca ggcccactgc aacatcagca ccgaccgctg gaacaagacc 960 

ctgcagcagg tgatgaagaa gctgggcgag cacttcccca acaagaccat ccagttcaag 1020 

ccccacgccg gcggcgacct ggagatcacc atgcacagct tcaactgccg cggcgagttc 1080 

ttctactgca acaccagcaa cctgttcaac agcacctacc acagcaacaa cggcacctac 1140 

aagtacaacg gcaacagcag cagccccatc accctgcagt gcaagatcaa gcagatcgtg 1200 

cgcatgtggc agggcgtggg ccaggccacc tacgcccccc ccatcgccgg caacatcacc 1260 

tgccgcagca acatcaccgg catcctgctg acccgcgacg gcggcttcaa caccaccaac 1320 

aacaccgaga ccttccgccc cggcggcggc gacatgcgcg acaactggcg cagcgagctg 1380 

tacaagtaca aggtggtgga gatcaagccc ctgggcatcg cccccaccaa ggccatcagc 1440 

agcgtggtgc agagcgagaa gagcgccgtg ggcatcggcg ccgtgttcct gggcttcctg 1500 
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ggcgccgccg gcagcaccat gggcgccgcc agcatcaccc tgaccgtgca ggcccgccag 1560 

ctgctgagcg gcatcgtgca gcagcagagc aacctgctga aggccatcga ggcccagcag 162 0 

cacatgctgc agctgaccgt gtggggcatc aagcagctgc aggcccgcgt gctggccatc 1680 

gagcgctacc tgaaggacca gcagctgctg ggcatctggg gctgcagcgg ccgcctgatc 1740 

tgcaccaccg ccgtgccctg gaacagcagc tggagcaaca agagcgagaa ggacatctgg 1800 

gacaacatga cctggatgca gtgggaccgc gagatcagca actacaccgg cctgatctac 1860 

aacctgctgg aggacagcca gaaccagcag gagaagaacg agaaggacct gctggagctg 192 0 

gacaagtgga acaacctgtg gaactggttc gacatcagca actggccctg gtacatcaag 1980 

atcttcatca tgatcgtggg cggcctgatc ggcctgcgca tcatcttcgc cgtgctgagc 2040 

atcgtgaacc gcgtgcgcca gggctacagc cccctgagct tccagaccct gacccccagc 2100 

ccccgcggcc tggaccgcct gggcggcatc gaggaggagg gcggcgagca ggaccgcgac 2160 

cgcagcatcc gcctggtgag cggcttcctg agcctggcct gggacgacct gcgcaacctg 2220 

tgcctgttca gctaccaccg cctgcgcgac ttcatcctga tcgccgtgcg cgccgtggag 2280 

ctgctgggcc acagcagcct gcgcggcctg cagcgcggct gggagatcct gaagtacctg 2340 

ggcagcctgg tgcagtactg gggcctggag ctgaagaaga gcgccatcag cctgctggac 2400 

accatcgcca tcaccgtggc cgagggcacc gaccgcatca tcgagctggt gcagcgcatc 2460 

tgccgcgcca tcctgaacat cccccgccgc atccgccagg gcttcgaggc cgccctgctg 2520 

taactcgag 2529 

<210> 125 
<211> 2613 
<212> DNA 
<213> Artificial 

<220> 

<223> gpl60mod.TVl . tpal 
<400> 125 

gtcgacgcca ccatggatgc aatgaagaga gggctctgct gtgtgctgct gctgtgtgga 60 

gcagtcttcg tttcgcccag cgccagcacc gaggacctgt gggtgaccgt gtactacggc 12 0 

gtgcccgtgt ggcgcgacgc caagaccacc ctgttctgcg ccagcgacgc caaggcctac 180 

gagaccgagg tgcacaacgt gtgggccacc cacgcctgcg tgcccaccga ccccaacccc 240 

caggagatcg tgctgggcaa cgtgaccgag aacttcaaca tgtggaagaa cgacatggcc 300 

gaccagatgc acgaggacgt gatcagcctg tgggaccaga gcctgaagcc ctgcgtgaag 3 60 
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ctgacccccc tgtgcgtgac cctgaactgc accgacacca acgtgaccgg caaccgcacc 420 

gtgaccggca acagcaccaa caacaccaac ggcaccggca tctacaacat cgaggagatg 480 

aagaactgca gcttcaacgc caccaccgag ctgcgcgaca agaagcacaa ggagtacgcc 540 

ctgttctacc gcctggacat cgtgcccctg aacgagaaca gcgacaactt cacctaccgc 600 

ctgatcaact gcaacaccag caccatcacc caggcctgcc ccaaggtgag cttcgacccc 660 

atccccatcc actactgcgc ccccgccggc tacgccatcc tgaagtgcaa caacaagacc 720 

ttcaacggca ccggcccctg ctacaacgtg agcaccgtgc agtgcaccca cggcatcaag 780 

cccgtggtga gcacccagct gctgctgaac ggcagcctgg ccgaggaggg catcatcatc 840 

cgcagcgaga acctgaccga gaacaccaag accatcatcg tgcacctgaa cgagagcgtg 900 

gagatcaact gcacccgccc caacaacaac acccgcaaga gcgtgcgcat cggccccggc 960 

caggccttct acgccaccaa cgacgtgatc ggcaacatcc gccaggccca ctgcaacatc 1020 

agcaccgacc gctggaacaa gaccctgcag caggtgatga agaagctggg cgagcacttc 1080 

cccaacaaga ccatccagtt caagccccac gccggcggcg acctggagat caccatgcac 1140 

agcttcaact gccgcggcga gttcttctac tgcaacacca gcaacctgtt caacagcacc 1200 

taccacagca acaacggcac ctacaagtac aacggcaaca gcagcagccc catcaccctg 1260 

cagtgcaaga tcaagcagat cgtgcgcatg tggcagggcg tgggccaggc cacctacgcc 1320 

ccccccatcg ccggcaacat cacctgccgc agcaacatca ccggcatcct gctgacccgc 1380 

gacggcggct tcaacaccac caacaacacc gagaccttcc gccccggcgg cggcgacatg 1440 

cgcgacaact ggcgcagcga gctgtacaag tacaaggtgg tggagatcaa gcccctgggc 1500 

atcgccccca ccaaggccaa gcgccgcgtg gtgcagcgcg agaagcgcgc cgtgggcatc 1560 

ggcgccgtgt tcctgggctt cctgggcgcc gccggcagca ccatgggcgc cgccagcatc 1620 

accctgaccg tgcaggcccg ccagctgctg agcggcatcg tgcagcagca gagcaacctg 1680 

ctgaaggcca tcgaggccca gcagcacatg ctgcagctga ccgtgtgggg catcaagcag 1740 

ctgcaggccc gcgtgctggc catcgagcgc tacctgaagg accagcagct gctgggcatc 1800 

tggggctgca gcggccgcct gatctgcacc accgccgtgc cctggaacag cagctggagc 1860 

aacaagagcg agaaggacat ctgggacaac atgacctgga tgcagtggga ccgcgagatc 1920 

agcaactaca ccggcctgat ctacaacctg ctggaggaca gccagaacca gcaggagaag 1980 

aacgagaagg acctgctgga gctggacaag tggaacaacc tgtggaactg gttcgacatc 2040 

112 



agcaactggc cctggtacat caagatcttc atcatgatcg tgggcggcct gatcggcctg 2100 

cgcatcatct tcgccgtgct gagcatcgtg aaccgcgtgc gccagggcta cagccccctg 2160 

agcttccaga ccctgacccc cagcccccgc ggcctggacc gcctgggcgg catcgaggag 2220 

gagggcggcg agcaggaccg cgaccgcagc atccgcctgg tgagcggctt cctgagcctg 22 80 

gcctgggacg acctgcgcaa cctgtgcctg ttcagctacc accgcctgcg cgacttcatc 2340 

ctgatcgccg tgcgcgccgt ggagctgctg ggccacagca gcctgcgcgg cctgcagcgc 2400 

ggctgggaga tcctgaagta cctgggcagc ctggtgcagt actggggcct ggagctgaag 2460 

aagagcgcca tcagcctgct ggacaccatc gccatcaccg tggccgaggg caccgaccgc 2520 

atcatcgagc tggtgcagcg catctgccgc gccatcctga acatcccccg ccgcatccgc 2580 

cagggcttcg aggccgccct gctgtaactc gag 2613 

<210> 126 
<211> 2616 
<212> DNA 
<213> Artificial 

<220> 

<223> gpieOmod.TVl 
<400> 126 

gaattcatgc gcgtgatggg cacccagaag aactgccagc agtggtggat ctggggcatc 60 

ctgggcttct ggatgctgat gatctgcaac accgaggacc tgtgggtgac cgtgtactac 120 

ggcgtgcccg tgtggcgcga cgccaagacc accctgttct gcgccagcga cgccaaggcc 180 

tacgagaccg aggtgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac 240 

ccccaggaga tcgtgctggg caacgtgacc gagaacttca acatgtggaa gaacgacatg 3 00 

gccgaccaga tgcacgagga cgtgatcagc ctgtgggacc agagcctgaa gccctgcgtg 3 60 

aagctgaccc ccctgtgcgt gaccctgaac tgcaccgaca ccaacgtgac cggcaaccgc 420 

accgtgaccg gcaacagcac caacaacacc aacggcaccg gcatctacaa catcgaggag 480 

atgaagaact gcagcttcaa cgccaccacc gagctgcgcg acaagaagca caaggagtac 540 

gccctgttct accgcctgga catcgtgccc ctgaacgaga acagcgacaa cttcacctac 600 

cgcctgatca actgcaacac cagcaccatc acccaggcct gccccaaggt gagcttcgac 660 

cccatcccca tccactactg cgcccccgcc ggctacgcca tcctgaagtg caacaacaag 720 

accttcaacg gcaccggccc ctgctacaac gtgagcaccg tgcagtgcac ccacggcatc 780 

aagcccgtgg tgagcaccca gctgctgctg aacggcagcc tggccgagga gggcatcatc 840 
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atccgcagcg agaacctgac cgagaacacc aagaccatca tcgtgcacct gaacgagagc 900 

gtggagatca actgcacccg ccccaacaac aacacccgca agagcgtgcg catcggcccc 960 

ggccaggcct tctacgccac caacgacgtg atcggcaaca tccgccaggc ccactgcaac 1020 

atcagcaccg accgctggaa caagaccctg cagcaggtga tgaagaagct gggcgagcac 1080 

ttccccaaca agaccatcca gttcaagccc cacgccggcg gcgacctgga gatcaccatg 1140 

cacagcttca actgccgcgg cgagttcttc tactgcaaca ccagcaacct gttcaacagc 12 00 

acctaccaca gcaacaacgg cacctacaag tacaacggca acagcagcag ccccatcacc 1260 

ctgcagtgca agatcaagca gatcgtgcgc atgtggcagg gcgtgggcca ggccacctac 132 0 

gcccccccca tcgccggcaa catcacctgc cgcagcaaca tcaccggcat cctgctgacc 1380 

cgcgacggcg gcttcaacac caccaacaac accgagacct tccgccccgg cggcggcgac 1440 

atgcgcgaca actggcgcag cgagctgtac aagtacaagg tggtggagat caagcccctg 1500 

ggcatcgccc ccaccaaggc caagcgccgc gtggtgcagc gcgagaagcg cgccgtgggc 1560 

atcggcgccg tgttcctggg cttcctgggc gccgccggca gcaccatggg cgccgccagc 1620 

atcaccctga ccgtgcaggc ccgccagctg ctgagcggca tcgtgcagca gcagagcaac 1680 

ctgctgaagg ccatcgaggc ccagcagcac atgctgcagc tgaccgtgtg gggcatcaag 1740 

cagctgcagg cccgcgtgct ggccatcgag cgctacctga aggaccagca gctgctgggc 1800 

atctggggct gcagcggccg cctgatctgc accaccgccg tgccctggaa cagcagctgg 1860 

agcaacaaga gcgagaagga catctgggac aacatgacct ggatgcagtg ggaccgcgag 1920 

atcagcaact acaccggcct gatctacaac ctgctggagg acagccagaa ccagcaggag 1980 

aagaacgaga aggacctgct ggagctggac aagtggaaca acctgtggaa ctggttcgac 2040 

atcagcaact ggccctggta catcaagatc ttcatcatga tcgtgggcgg cctgatcggc 2100 

ctgcgcatca tcttcgccgt gctgagcatc gtgaaccgcg tgcgccaggg ctacagcccc 2160 

ctgagcttcc agaccctgac ccccagcccc cgcggcctgg accgcctggg cggcatcgag 2220 

gaggagggcg gcgagcagga ccgcgaccgc agcatccgcc tggtgagcgg cttcctgagc 22 80 

ctggcctggg acgacctgcg caacctgtgc ctgttcagct accaccgcct gcgcgacttc 2340 

atcctgatcg ccgtgcgcgc cgtggagctg ctgggccaca gcagcctgcg cggcctgcag 2400 

cgcggctggg agatcctgaa gtacctgggc agcctggtgc agtactgggg cctggagctg 2460 

aagaagagcg ccatcagcct gctggacacc atcgccatca ccgtggccga gggcaccgac 2520 
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cgcatcatcg agctggtgca gcgcatctgc cgcgccatcc tgaacatccc ccgccgcatc 
cgccagggct tcgaggccgc cctgctgtaa ctcgag 



2580 
2616 



<210> 127 
<211> 2616 
<212> DNA 
<213> Artificial 

<220> 

<223> gpl60inod.TVl . wtLnative 
<400> 127 

gaattcatga gagtgatggg gacacagaag aattgtcaac aatggtggat atggggcatc 60 

ttaggcttct ggatgctaat gatttgtaac accgaggacc tgtgggtgac cgtgtactac 120 

9gcgtgcccg tgtggcgcga cgccaagacc accctgttct gcgccagcga cgccaaggcc 180 

tacgagaccg aggtgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac 240 

ccccaggaga tcgtgctggg caacgtgacc gagaacttca acatgtggaa gaacgacatg 300 

gccgaccaga tgcacgagga cgtgatcagc ctgtgggacc agagcctgaa gccctgcgtg 360 

aagctgaccc ccctgtgcgt gaccctgaac tgcaccgaca ccaacgtgac cggcaaccgc 420 

accgtgaccg gcaacagcac caacaacacc aacggcaccg gcatctacaa catcgaggag 480 

atgaagaact gcagcttcaa cgccaccacc gagctgcgcg acaagaagca caaggagtac 540 

gccctgttct accgcctgga catcgtgccc ctgaacgaga acagcgacaa cttcacctac 600 

cgcctgatca actgcaacac cagcaccatc acccaggcct gccccaaggt gagcttcgac 660 

cccatcccca tccactactg cgcccccgcc ggctacgcca tcctgaagtg caacaacaag 720 

accttcaacg gcaccggccc ctgctacaac gtgagcaccg tgcagtgcac ccacggcatc 780 

aagcccgtgg tgagcaccca gctgctgctg aacggcagcc tggccgagga gggcatcatc 840 

atccgcagcg agaacctgac cgagaacacc aagaccatca tcgtgcacct gaacgagagc 900 

gtggagatca actgcacccg ccccaacaac aacacccgca agagcgtgcg catcggcccc 960 

ggccaggcct tctacgccac caacgacgtg atcggcaaca tccgccaggc ccactgcaac 1020 

atcagcaccg accgctggaa caagaccctg cagcaggtga tgaagaagct gggcgagcac 1080 

ttccccaaca agaccatcca gttcaagccc cacgccggcg gcgacctgga gatcaccatg 1140 

cacagcttca actgccgcgg cgagttcttc tactgcaaca ccagcaacct gttcaacagc 1200 

acctaccaca gcaacaacgg cacctacaag tacaacggca acagcagcag ccccatcacc 1260 

ctgcagtgca agatcaagca gatcgtgcgc atgtggcagg gcgtgggcca ggccacctac 1320 
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gcccccccca 


tcgccggcaa 


catcacctgc 


cgcagcaaca 


tcaccggcat 


cctgctgacc 


1380 


cgcgacggcg 


gcttcaacac 


caccaacaac 


accgagacct 


tccgccccgg 


cggcggcgac 


1440 


atgcgcgaca 


actggcgcag 


cgagctgtac 


aagtacaagg 


tggtggagat 


caagcccctg 


1500 


ggcatcgccc 


ccaccaaggc 


caagcgccgc 


gtggtgcagc 


gcgagaagcg 


cgccgtgggc 


1560 


atcggcgccg 


tgttcctggg 


cttcctgggc 


gccgccggca 


gcaccatggg 


cgccgccagc 


1620 


atcaccctga 


ccgtgcaggc 


ccgccagctg 


ctgagcggca 


tcgtgcagca 


gcagagcaac 


1680 


ctgctgaagg 


ccatcgaggc 


ccagcagcac 


atgctgcagc 


tgaccgtgtg 


gggcatcaag 


1740 


cagctgcagg 


cccgcgtgct 


ggccatcgag 


cgctacctga 


aggaccagca 


gctgctgggc 


1800 


atctggggct 


gcagcggccg 


cctgatctgc 


accaccgccg 


tgccctggaa 


cagcagctgg 


1860 


agcaacaaga 


gcgagaagga 


catctgggac 


aacatgacct 


ggatgcagtg 


ggaccgcgag 


1920 


atcagcaact 


acaccggcct 


gatctacaac 


ctgctggagg 


acagccagaa 


ccagcaggag 


1980 


aagaacgaga 


aggacctgct 


ggagctggac 


aagtggaaca 


acctgtggaa 


ctggttcgac 


2040 


atcagcaact 


ggccctggta 


catcaagatc 


ttcatcatga 


tcgtgggcgg 


cctgatcggc 


2100 


ctgcgcatca 


tcttcgccgt 


gctgagcatc 


gtgaaccgcg 


tgcgccaggg 


ctacagcccc 


2160 


ctgagcttcc 


agaccctgac 


ccccagcccc 


cgcggcctgg 


accgcctggg 


cggcatcgag 


2220 


gaggagggcg 


gcgagcagga 


ccgcgaccgc 


agcatccgcc 


tggtgagcgg 


cttcctgagc 


2280 


ctggcctggg 


acgacctgcg 


caacctgtgc 


ctgttcagct 


accaccgcct 


gcgcgacttc 


2340 


atcctgatcg 


ccgtgcgcgc 


cgtggagctg 


ctgggccaca 


gcagcctgcg 


cggcctgcag 


2400 


cgcggctggg 


agatcctgaa 


gtacctgggc 


agcctggtgc 


agtactgggg 


cctggagctg 


2460 


aagaagagcg 


ccatcagcct 


gctggacacc 


atcgccatca 


ccgtggccga 


gggcaccgac 


2520 


cgcatcatcg 


agctggtgca 


gcgcatctgc 


cgcgccatcc 


tgaacatccc 


ccgccgcatc 


2580 


cgccagggct 


tcgaggccgc 


cctgctgtaa 


ctcgag 






2616 



<210> 128 

<211> 2604 

<212> DNA 

<213> Artificial 

<220> 

<223> Wild-type Env gpl60 (8_2_ZA) 
<400> 128 

atgagagtga tggggacaca gaagaattgt caacaatggt ggatatgggg catcttaggc 60 
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ttctggatgc taatgatttg taacacggag gacttgtggg tcacagtcta ctatggggta 120 

cctgtgtgga gagacgcaaa aactactcta ttctgtgcat cagatgctaa agcatatgag 180 

acagaagtgc ataatgtctg ggctacacat gcctgtgtac ccacagaccc caacccacaa 240 

gaaatagttt tgggaaatgt aacagaaaat tttaatatgt ggaaaaatga catggcagat 300 

cagatgcatg aggatgtaat cagtttatgg gatcaaagcc taaagccatg tgtaaagttg 360 

accccactct gtgtcacttt aaactgtaca gatacaaatg ttacaggtaa tagaactgtt 420 

acaggtaata gtaccaataa tacaaatggt acaggtattt ataacattga agaaatgaaa 480 

aattgctctt tcaatgcaac cacagaatta agagataaga aacataaaga gtatgcactc 540 

ttttatagac ttgatatagt accacttaat gagaatagtg acaactttac atatagatta 600 

ataaattgca atacctcaac cataacacaa gcctgtccaa aggtctcttt tgacccgatt 660 

cctatacatt actgtgctcc agctggttat gcgattctaa agtgtaataa taagacattc 720 

aatgggacag gaccatgtta taatgtcagc acagtacaat gtacacatgg aattaagcca 780 

gtggtatcaa ctcaattact gttaaatggt agtctagcag aagaagggat aataattaga 840 

tctgaaaatt tgacagagaa taccaaaaca ataatagtac accttaatga atctgtagag 900 

attaattgta caagacccaa caataataca agaaaaagtg taaggatagg accaggacaa 960 

gcattctatg caacaaatga tgtaatagga aacataagac aagcacattg taacattagt 1020 

acagatagat ggaacaaaac tttacaacag gtaatgaaaa aattaggaga gcatttccct 1080 

aataaaacaa tacaatttaa accacatgca ggaggggatc tagaaattac aatgcatagc 1140 

tttaattgta gaggagaatt tttctattgt aatacatcaa acctgtttaa tagcacatac 1200 

cactctaata atggtacata caaatacaat ggtaattcaa gctcacccat cacactccaa 1260 

tgtaaaataa aacaaattgt acgcatgtgg caaggggtag gacaagcaac gtatgcccct 1320 

cccattgcag gaaacataac atgtagatca aacatcacag gaatactatt gacacgtgat 1380 

ggaggattta acaccacaaa caacacagag acattcagac ctggaggagg agatatgagg 1440 

gataactgga gaagtgaatt atataaatat aaagtagtag aaattaagcc attgggaata 1500 

gcacccacta aggcaaaaag aagagtggtg cagagagaaa aaagagcagt gggaatagga 1560 

gctgtgttcc ttgggttctt gggagcagca ggaagcacta tgggcgcagc gtcaataacg 162 0 

ctgacggtac aggccagaca actgttgtct ggtatagtgc aacagcaaag caatttgctg 1680 

aaggctatag aggcgcaaca gcatatgttg caactcacag tctggggcat taagcagctc 1740 

caggcgagag tcctggctat agaaagatac ctaaaggatc aacagctcct agggatttgg 1800 
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ggctgctctg 


gaagactcat 


ctgcaccact 


gctgtgcctt 


ggaactccag 


ttggagtaat 


1860 


aaatctgaaa 


aagatatttg 


ggataacatg 


acttggatgc 


agtgggatag 


agaaattagt 


1920 


aattacacag 


gcttaatata 


caatttgctt 


gaagactcgc 


aaaaccagca 


ggaaaagaat 


1980 


gaaaaagatt 


tattagaatt 


ggacaagtgg 


aacaatctgt 


ggaattggtt 


tgacatatca 


2040 


aactggccgt 


ggtatataaa 


aatattcata 


atgatagtag 


gaggcttgat 


aggtttaaga 


2100 


ataatttttg 


ctgtgctttc 


tatagtgaat 


agagttaggc 


agggatactc 


acctttgtca 


2160 


tttcagaccc 


ttaccccaag 


cccgagggga 


ctcgacaggc 


tcggaggaat 


cgaagaagaa 


2220 


ggtggagagc 


aagacagaga 


cagatccata 


cgattggtga 


gcggattctt 


gtcgcttgcc 


2280 


tgggacgatc 


tgcggaacct 


gtgcctcttc 


agctaccacc 


gcttgagaga 


cttcatatta 


2340 


attgcagtga 


gggcagtgga 


acttctggga 


cacagcagtc 


tcaggggact 


acagaggggg 


2400 


tgggaaatcc 


ttaagtatct 


gggaagtctt 


gtgcaatatt 


ggggtctaga 


gctaaaaaag 


2460 


agtgctatta 


gtctgcttga 


taccatagca 


ataacagtag 


ctgaaggaac 


agataggatt 


2520 


atagaattag 


tacaaagaat 


ttgtagagct 


atcctcaaca 


tacctagaag 


aataagacag 


2580 


ggctttgaag 


cagctttgct 


ataa 








2604 



<210> 129 

<211> 10 

<212> PRT 

<213> Artificial 

<220> 

<223> wild-type amino acid sequence changed by mutation in gpl20/gp41 
cleavage site 

<400> 129 

Lys Arg Arg Val Val Gin Arg Glu Lys Arg 
15 10 

<210> 130 

<211> 10 

<212> PRT 

<213> Artificial 

<220> 

<223> wild- type amino acid sequence changed by mutation in gpl2 0/gp41 
cleavage site 

<400> 130 

lie Ser Ser Val Val Gin Ser Glu Lys Ser 
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1 



10 



<210> 131 
<211> 2052 
<212> DNA 
<213> Artificial 

<220> 

<223> gpl40inod.TVl . tpal 
<400> 131 

atggatgcaa tgaagagagg gctctgctgt gtgctgctgc tgtgtggagc agtcttcgtt 60 

tcgcccagcg ccagcaccga ggacctgtgg gtgaccgtgt actacggcgt gcccgtgtgg 120 

cgcgacgcca agaccaccct gttctgcgcc agcgacgcca aggcctacga gaccgaggtg 180 

cacaacgtgt gggccaccca cgcctgcgtg cccaccgacc ccaaccccca ggagatcgtg 240 

ctgggcaacg tgaccgagaa cttcaacatg tggaagaacg acatggccga ccagatgcac 3 00 

gaggacgtga tcagcctgtg ggaccagagc ctgaagccct gcgtgaagct gacccccctg 360 

tgcgtgaccc tgaactgcac cgacaccaac gtgaccggca accgcaccgt gaccggcaac 420 

agcaccaaca acaccaacgg caccggcatc tacaacatcg aggagatgaa gaactgcagc 480 

ttcaacgcca ccaccgagct gcgcgacaag aagcacaagg agtacgccct gttctaccgc 540 

ctggacatcg tgcccctgaa cgagaacagc gacaacttca cctaccgcct gatcaactgc 600 

aacaccagca ccatcaccca ggcctgcccc aaggtgagct tcgaccccat ccccatccac 660 

tactgcgccc ccgccggcta cgccatcctg aagtgcaaca acaagacctt caacggcacc 720 

ggcccctgct acaacgtgag caccgtgcag tgcacccacg gcatcaagcc cgtggtgagc 780 

acccagctgc tgctgaacgg cagcctggcc gaggagggca tcatcatccg cagcgagaac 840 

ctgaccgaga acaccaagac catcatcgtg cacctgaacg agagcgtgga gatcaactgc 900 

acccgcccca acaacaacac ccgcaagagc gtgcgcatcg gccccggcca ggccttctac 960 

gccaccaacg acgtgatcgg caacatccgc caggcccact gcaacatcag caccgaccgc 1020 

tggaacaaga ccctgcagca ggtgatgaag aagctgggcg agcacttccc caacaagacc 1080 

atccagttca agccccacgc cggcggcgac ctggagatca ccatgcacag cttcaactgc 1140 

cgcggcgagt tcttctactg caacaccagc aacctgttca acagcaccta ccacagcaac 12 00 

aacggcacct acaagtacaa cggcaacagc agcagcccca tcaccctgca gtgcaagatc 12 60 

aagcagatcg tgcgcatgtg gcagggcgtg ggccaggcca cctacgcccc ccccatcgcc 1320 

ggcaacatca cctgccgcag caacatcacc ggcatcctgc tgacccgcga cggcggcttc 1380 
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aacaccacca acaacaccga gaccttccgc cccggcggcg gcgacatgcg cgacaactgg 1440 

cgcagcgagc tgtacaagta caaggtggtg gagatcaagc ccctgggcat cgcccccacc 1500 

aaggccaagc gccgcgtggt gcagcgcgag aagcgcgccg tgggcatcgg cgccgtgttc 1560 

ctgggcttcc tgggcgccgc cggcagcacc atgggcgccg ccagcatcac cctgaccgtg 1620 

caggcccgcc agctgctgag cggcatcgtg cagcagcaga gcaacctgct gaaggccatc 1680 

gaggcccagc agcacatgct gcagctgacc gtgtggggca tcaagcagct gcaggcccgc 1740 

gtgctggcca tcgagcgcta cctgaaggac cagcagctgc tgggcatctg gggctgcagc 1800 

ggccgcctga tctgcaccac cgccgtgccc tggaacagca gctggagcaa caagagcgag 1860 

aaggacatct gggacaacat gacctggatg cagtgggacc gcgagatcag caactacacc 1920 

ggcctgatct acaacctgct ggaggacagc cagaaccagc aggagaagaa cgagaaggac 1980 

ctgctggagc tggacaagtg gaacaacctg tggaactggt tcgacatcag caactggccc 2040 

tggtacatct aa 2052 

<210> 132 
<211> 2073 
<212> DNA 
<213> Artificial 

<220> 

<223> gpl40inod.TVl 
<400> 132 

gaattcatgc gcgtgatggg cacccagaag aactgccagc agtggtggat ctggggcatc 60 

ctgggcttct ggatgctgat gatctgcaac accgaggacc tgtgggtgac cgtgtactac 120 

ggcgtgcccg tgtggcgcga cgccaagacc accctgttct gcgccagcga cgccaaggcc 180 

tacgagaccg aggtgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac 240 

ccccaggaga tcgtgctggg caacgtgacc gagaacttca acatgtggaa gaacgacatg 300 

gccgaccaga tgcacgagga cgtgatcagc ctgtgggacc agagcctgaa gccctgcgtg 360 

aagctgaccc ccctgtgcgt gaccctgaac tgcaccgaca ccaacgtgac cggcaaccgc 420 

accgtgaccg gcaacagcac caacaacacc aacggcaccg gcatctacaa catcgaggag 480 

atgaagaact gcagcttcaa cgccaccacc gagctgcgcg acaagaagca caaggagtac 540 

gccctgttct accgcctgga catcgtgccc ctgaacgaga acagcgacaa cttcacctac 600 

cgcctgatca actgcaacac cagcaccatc acccaggcct gccccaaggt gagcttcgac 660 
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cccatcccca tccactactg cgcccccgcc ggctacgcca tcctgaagtg caacaacaag 720 

accttcaacg gcaccggccc ctgctacaac gtgagcaccg tgcagtgcac ccacggcatc 780 

aagcccgtgg tgagcaccca gctgctgctg aacggcagcc tggccgagga gggcatcatc 840 

atccgcagcg agaacctgac cgagaacacc aagaccatca tcgtgcacct gaacgagagc 900 

gtggagatca actgcacccg ccccaacaac aacacccgca agagcgtgcg catcggcccc 960 

ggccaggcct tctacgccac caacgacgtg atcggcaaca tccgccaggc ccactgcaac 1020 

atcagcaccg accgctggaa caagaccctg cagcaggtga tgaagaagct gggcgagcac 1080 

ttccccaaca agaccatcca gttcaagccc cacgccggcg gcgacctgga gatcaccatg 1140 

cacagcttca actgccgcgg cgagttcttc tactgcaaca ccagcaacct gttcaacagc 1200 

acctaccaca gcaacaacgg cacctacaag tacaacggca acagcagcag ccccatcacc 1260 

ctgcagtgca agatcaagca gatcgtgcgc atgtggcagg gcgtgggcca ggccacctac 1320 

gcccccccca tcgccggcaa catcacctgc cgcagcaaca tcaccggcat cctgctgacc 1380 

cgcgacggcg gcttcaacac caccaacaac accgagacct tccgccccgg cggcggcgac 1440 

atgcgcgaca actggcgcag cgagctgtac aagtacaagg tggtggagat caagcccctg 1500 

ggcatcgccc ccaccaaggc caagcgccgc gtggtgcagc gcgagaagcg cgccgtgggc 1560 

atcggcgccg tgttcctggg cttcctgggc gccgccggca gcaccatggg cgccgccagc 1620 

atcaccctga ccgtgcaggc ccgccagctg ctgagcggca tcgtgcagca gcagagcaac 1680 

ctgctgaagg ccatcgaggc ccagcagcac atgctgcagc tgaccgtgtg gggcatcaag 1740 

cagctgcagg cccgcgtgct ggccatcgag cgctacctga aggaccagca gctgctgggc 1800 

atctggggct gcagcggccg cctgatctgc accaccgccg tgccctggaa cagcagctgg 1860 

agcaacaaga gcgagaagga catctgggac aacatgacct ggatgcagtg ggaccgcgag 1920 

atcagcaact acaccggcct gatctacaac ctgctggagg acagccagaa ccagcaggag 1980 

aagaacgaga aggacctgct ggagctggac aagtggaaca acctgtggaa ctggttcgac 2040 

atcagcaact ggccctggta catctaactc gag 2073 

<210> 133 
<211> 2073 
<212> DNA 
<213> Artificial 

<220> 

<223> gpl40mod.TVl .wtLnative 
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<400> 133 

gaattcatga gagtgatggg gacacagaag aattgtcaac aatggtggat atggggcatc 60 

ttaggcttct ggatgctaat gatttgtaac accgaggacc tgtgggtgac cgtgtactac 120 

ggcgtgcccg tgtggcgcga cgccaagacc accctgttct gcgccagcga cgccaaggcc 180 

tacgagaccg aggtgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac 240 

ccccaggaga tcgtgctggg caacgtgacc gagaacttca acatgtggaa gaacgacatg 300 

gccgaccaga tgcacgagga cgtgatcagc ctgtgggacc agagcctgaa gccctgcgtg 360 

aagctgaccc ccctgtgcgt gaccctgaac tgcaccgaca ccaacgtgac cggcaaccgc 420 

accgtgaccg gcaacagcac caacaacacc aacggcaccg gcatctacaa catcgaggag 480 

atgaagaact gcagcttcaa cgccaccacc gagctgcgcg acaagaagca caaggagtac 540 

gccctgttct accgcctgga catcgtgccc ctgaacgaga acagcgacaa cttcacctac 600 

cgcctgatca actgcaacac cagcaccatc acccaggcct gccccaaggt gagcttcgac 660 

cccatcccca tccactactg cgcccccgcc ggctacgcca tcctgaagtg caacaacaag 720 

accttcaacg gcaccggccc ctgctacaac gtgagcaccg tgcagtgcac ccacggcatc 780 

aagcccgtgg tgagcaccca gctgctgctg aacggcagcc tggccgagga gggcatcatc 840 

atccgcagcg agaacctgac cgagaacacc aagaccatca tcgtgcacct gaacgagagc 900 

gtggagatca actgcacccg ccccaacaac aacacccgca agagcgtgcg catcggcccc 960 

ggccaggcct tctacgccac caacgacgtg atcggcaaca tccgccaggc ccactgcaac 1020 

atcagcaccg accgctggaa caagaccctg cagcaggtga tgaagaagct gggcgagcac 1080 

ttccccaaca agaccatcca gttcaagccc cacgccggcg gcgacctgga gatcaccatg 1140 

cacagcttca actgccgcgg cgagttcttc tactgcaaca ccagcaacct gttcaacagc 1200 

acctaccaca gcaacaacgg cacctacaag tacaacggca acagcagcag ccccatcacc 1260 

ctgcagtgca agatcaagca gatcgtgcgc atgtggcagg gcgtgggcca ggccacctac 1320 

gcccccccca tcgccggcaa catcacctgc cgcagcaaca tcaccggcat cctgctgacc 1380 

cgcgacggcg gcttcaacac caccaacaac accgagacct tccgccccgg cggcggcgac 1440 

atgcgcgaca actggcgcag cgagctgtac aagtacaagg tggtggagat caagcccctg 1500 

ggcatcgccc ccaccaaggc caagcgccgc gtggtgcagc gcgagaagcg cgccgtgggc 1560 

atcggcgccg tgttcctggg cttcctgggc gccgccggca gcaccatggg cgccgccagc 1620 

atcaccctga ccgtgcaggc ccgccagctg ctgagcggca tcgtgcagca gcagagcaac 1680 
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ctgctgaagg ccatcgaggc ccagcagcac atgctgcagc tgaccgtgtg gggcatcaag 1740 

cagctgcagg cccgcgtgct ggccatcgag cgctacctga aggaccagca gctgctgggc 1800 

atctggggct gcagcggccg cctgatctgc accaccgccg tgccctggaa cagcagctgg 1860 

agcaacaaga gcgagaagga catctgggac aacatgacct ggatgcagtg ggaccgcgag 1920 

atcagcaact acaccggcct gatctacaac ctgctggagg acagccagaa ccagcaggag 1980 

aagaacgaga aggacctgct ggagctggac aagtggaaca acctgtggaa ctggttcgac 2040 

atcagcaact ggccctggta catctaactc gag 2073 

<210> 134 
<211> 624 
<212> DNA 
<213> Artificial 

<220> 

<223> NefD125G_TV2_C_ZAopt 
<400> 134 

atgggcggca agtggagcaa gagcagcatc atcggctggc ccgaggtgcg cgagcgcatc 60 

cgccgcaccc gcagcgccgc cgagggcgtg ggcagcgcca gccaggacct ggagaagcac 12 0 

ggcgccctga ccaccagcaa caccgcccac aacaacgccg cctgcgcctg gctggaggcc 180 

caggaggagg agggcgaggt gggcttcccc gtgcgccccc aggtgcccct gcgccccatg 240 

acctacaagg ccgccatcga cctgagcttc ttcctgaagg agaagggcgg cctggagggc 300 

ctgatctaca gcaagaagcg ccaggagatc ctggacctgt gggtgtacaa cacccagggc 360 

ttcttccccg gctggcagaa ctacaccccc ggccccggcg tgcgcttccc cctgaccttc 420 

ggctggtact tcaagctgga gcccgtggac ccccgcgagg tggaggaggc caacgagggc 480 

gagaacaact gcctgctgca ccccatgagc cagcacggca tggaggacga ggaccgcgag 540 

gtgctgcgct ggaagttcga cagcaccctg gcccgccgcc acatggcccg cgagctgcac 600 

cccgagtact acaaggactg ctga 624 



<210> 135 

<211> 624 

<212> DNA 

<213> Artificial 

<220> 

<223> NefD12 5G-Myr_TV2_C_ZAop t 

<400> 135 

atggccggca agtggagcaa gagcagcatc atcggctggc ccgaggtgcg cgagcgcatc 60 
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cgccgcaccc gcagcgccgc cgagggcgtg ggcagcgcca gccaggacct ggagaagcac 120 

ggcgccctga ccaccagcaa caccgcccac aacaacgccg cctgcgcctg gctggaggcc 180 

caggaggagg agggcgaggt gggcttcccc gtgcgccccc aggtgcccct gcgccccatg 240 

acctacaagg ccgccatcga cctgagcttc ttcctgaagg agaagggcgg cctggagggc 300 

ctgatctaca gcaagaagcg ccaggagatc ctggacctgt gggtgtacaa cacccagggc 360 

ttcttccccg gctggcagaa ctacaccccc ggccccggcg tgcgcttccc cctgaccttc 420 

ggctggtact tcaagctgga gcccgtggac ccccgcgagg tggaggaggc caacgagggc 480 

gagaacaact gcctgctgca ccccatgagc cagcacggca tggaggacga ggaccgcgag 540 

gtgctgcgct ggaagttcga cagcaccctg gcccgccgcc acatggcccg cgagctgcac 600 

cccgagtact acaaggactg ctga 624 



<210> 136 

<211> 27 

<212> PRT 

<213> Artificial 

<220> 

<223> TVlc8.2 signal peptide leader sequence 
<400> 136 

Met Arg Val Met Gly Thr Gin Lys Asn Cys Gin Gin Trp Trp lie Trp 
15 10 15 



Gly lie Leu Gly Phe Trp Met Leu Met lie Cys 
20 25 



<210> 137 

<211> 81 

<212> DNA 

<213> Artificial 

<220> 

<223> WTnative (8-2_TVl_C. ZA) signal peptide leader sequence 
<400> 137 

atgagagtga tggggacaca gaagaattgt caacaatggt ggatatgggg catcttaggc 60 
ttctggatgc taatgatttg t 81 



<210> 138 
<211> 81 
<212> DNA 
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<213> 



Artificial 



<220> 
<223> 



WTmod (8-2_TVl_C. ZA) signal peptide leader sequence 



<400> 138 

atgcgcgtga tgggcaccca gaagaactgc cagcagtggt ggatctgggg catcctgggc 



60 



ttctggatgc tgatgatctg c 



81 



<210> 139 

<211> 25 

<212> PRT 

<213> Artificial 

<220> 

<223> Tpal signal peptide leader sequence 
<400> 139 

Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
15 10 15 



Ala Val Phe Val Ser Pro Ser Ala Ser 



<210> 140 

<211> 75 

<212> DNA 

<213> Artificial 

<220> 

<223> Tpal signal peptide leader sequence 
<400> 140 

atggatgcaa tgaagagagg gctctgctgt gtgctgctgc tgtgtggagc agtcttcgtt 60 
tcgcccagcg ccagc 75 



<210> 141 

<211> 23 

<212> PRT 

<213> Artificial 

<220> 

<223> Tpa2 signal peptide leader sequence 
<400> 141 

Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
15 10 15 



20 



25 
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Ala Val Phe Val Ser Pro Ser 
20 



<210> 142 

<211> 69 

<212> DNA 

<213> Artificial 

<220> 

<223> Tpa2 signal peptide leader sequence 
<400> 142 

atggatgcaa tgaagagagg gctctgctgt gtgctgctgc tgtgtggagc agtcttcgtt 60 
tcgcccagc 69 



<210> 143 
<211> 842 
<212> PRT 

<213> Human immunodeficiency virus SF162 
<400> 143 

Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
15 10 15 



Ala Val Phe Val Ser Pro Ser Ala Val Glu Lys Leu Trp Val Thr Val 
20 25 30 



Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys 
35 40 45 



Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala 
50 55 60 



Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gin Glu lie Val Leu 
65 70 75 80 



Glu Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asn Met Val Glu 
85 90 95 



Gin Met His Glu Asp lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro 
100 105 110 



Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu His Cys Thr Asn Leu 
115 120 125 
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Lys Asn Ala Thr Asn Thr Lys Ser Ser Asn Trp Lys Glu Met Asp Arg 
130 135 140 



Gly Glu lie Lys Asn Cys Ser Phe Lys Val Thr Thr Ser lie Arg Asn 
145 150 155 160 



Lys Met Gin Lys Glu Tyr Ala Leu Phe Tyr Lys Leu Asp Val Val Pro 
165 170 175 



lie Asp Asn Asp Asn Thr Ser Tyr Lys Leu lie Asn Cys Asn Thr Ser 
180 185 190 



Val lie Thr Gin Ala Cys Pro Lys Val Ser Phe Glu Pro lie Pro lie 
195 200 205 



His Tyr Cys Ala Pro Ala Gly Phe Ala lie Leu Lys Cys Asn Asp Lys 
210 215 220 



Lys Phe Asn Gly Ser Gly Pro Cys Thr Asn Val Ser Thr Val Gin Cys 
225 230 235 240 



Thr His Gly lie Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly 
245 250 255 



Ser Leu Ala Glu Glu Gly Val Val lie Arg Ser Glu Asn Phe Thr Asp 
260 265 270 



Asn Ala Lys Thr lie lie Val Gin Leu Lys Glu Ser Val Glu lie Asn 
275 280 285 



Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser lie Thr lie Gly Pro 
290 295 300 



Gly Arg Ala Phe Tyr Ala Thr Gly Asp lie lie Gly Asp lie Arg Gin 
305 310 315 320 



Ala His Cys Asn lie Ser Gly Glu Lys Trp Asn Asn Thr Leu Lys Gin 
325 330 335 



lie Val Thr Lys Leu Gin Ala Gin Phe Gly Asn Lys Thr lie Val Phe 
340 345 350 



Lys Gin Ser Ser Gly Gly Asp Pro Glu lie Val Met His Ser Phe Asn 
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355 



360 



365 



Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gin Leu Phe Asn Ser 
370 375 380 



Thr Trp Asn Asn Thr lie Gly Pro Asn Asn Thr Asn Gly Thr lie Thr 
385 390 395 400 



Leu Pro Cys Arg lie Lys Gin lie lie Asn Arg Trp Gin Glu Val Gly 
405 410 415 



Lys Ala Met Tyr Ala Pro Pro lie Arg Gly Gin lie Arg Cys Ser Ser 
420 425 430 



Asn lie Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Lys Glu lie Ser 
435 440 445 



Asn Thr Thr Glu lie Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn 
450 455 460 



Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys lie Glu Pro Leu 
465 470 475 480 



Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg Glu Lys 
485 490 495 



Arg Ala Val Thr Leu Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala 
500 505 510 



Gly Ser Thr Met Gly Ala Arg Ser Leu Thr Leu Thr Val Gin Ala Arg 
515 520 525 



Gin Leu Leu Ser Gly lie Val Gin Gin Gin Asn Asn Leu Leu Arg Ala 
530 535 540 



lie Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly lie Lys 
545 550 555 560 



Gin Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 
565 570 575 



Gin Leu Leu Gly lie Trp Gly Cys Ser Gly Lys Leu lie Cys Thr Thr 
580 585 590 
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Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Asp Gin lie 
595 600 605 



Trp Asn Asn Met Thr Trp Met Glu Trp Glu Arg Glu lie Asp Asn Tyr 
610 615 620 



Thr Asn Leu lie Tyr Thr Leu lie Glu Glu Ser Gin Asn Gin Gin Glu 
625 630 635 640 



Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp 
645 650 655 



Asn Trp Phe Asp lie Ser Lys Trp Leu Trp Tyr lie Lys lie Phe lie 
660 665 670 



Met lie Val Gly Gly Leu Val Gly Leu Arg lie Val Phe Thr Val Leu 
675 680 685 



Ser lie Val Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin 
690 695 700 



Thr Arg Phe Pro Ala Pro Arg Gly Pro Asp Arg Pro Glu Gly lie Glu 
705 710 715 720 



Glu Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser Ser Pro Leu Val His 
725 730 735 



Gly Leu Leu Ala Leu lie Trp Asp Asp Leu Arg Ser Leu Cys Leu Phe 
740 745 750 



Ser Tyr His Arg Leu Arg Asp Leu lie Leu lie Ala Ala Arg lie Val 
755 760 765 



Glu Leu Leu Gly Arg Arg Gly Trp Glu Ala Leu Lys Tyr Trp Gly Asn 
770 775 780 



Leu Leu Gin Tyr Trp lie Gin Glu Leu Lys Asn Ser Ala Val Ser Leu 
785 790 795 800 



Phe Asp Ala He Ala He Ala Val Ala Glu Gly Thr Asp Arg He He 
805 810 815 
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Glu Val Ala Gin Arg lie Gly Arg Ala Phe Leu His lie Pro Arg Arg 
820 825 830 



lie Arg Gin Gly Phe Glu Arg Ala Leu Leu 
835 840 



<210> 144 
<211> 867 
<212> PRT 

<213> Human immunodeficiency virus TVl . 8_2 
<400> 144 

Met Arg Val Met Gly Thr Gin Lys Asn Cys Gin Gin Trp Trp lie Trp 
15 10 15 



Gly lie Leu Gly Phe Trp Met Leu Met lie Cys Asn Thr Glu Asp Leu 
20 25 30 



Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Arg Asp Ala Lys Thr 
35 40 45 



Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Thr Glu Val His 
50 55 60 



Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gin 
65 70 75 80 



Glu lie Val Leu Gly Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn 
85 90 95 



Asp Met Ala Asp Gin Met His Glu Asp Val lie Ser Leu Trp Asp Gin 
100 105 110 



Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn 
115 120 125 



Cys Thr Asp Thr Asn Val Thr Gly Asn Arg Thr Val Thr Gly Asn Ser 
130 135 140 



Thr Asn Asn Thr Asn Gly Thr Gly lie Tyr Asn lie Glu Glu Met Lys 
145 150 155 160 



Asn Cys Ser Phe Asn Ala Thr Thr Glu Leu Arg Asp Lys Lys His Lys 
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165 170 175 



Glu Tyr Ala Leu Phe Tyr Arg Leu Asp lie Val Pro Leu Asn Glu Asn 
180 185 190 



Ser Asp Asn Phe Thr Tyr Arg Leu lie Asn Cys Asn Thr Ser Thr lie 
195 200 205 



Thr Gin Ala Cys Pro Lys Val Ser Phe Asp Pro lie Pro lie His Tyr 
210 215 220 



Cys Ala Pro Ala Gly Tyr Ala lie Leu Lys Cys Asn Asn Lys Thr Phe 
225 230 235 240 



Asn Gly Thr Gly Pro Cys Tyr Asn Val Ser Thr Val Gin Cys Thr His 
245 250 255 



Gly lie Lys Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu 
260 265 270 



Ala Glu Glu Gly lie lie lie Arg Ser Glu Asn Leu Thr Glu Asn Thr 
275 280 285 



Lys Thr lie lie Val His Leu Asn Glu Ser Val Glu lie Asn Cys Thr 
290 295 300 



Arg Pro Asn Asn Asn Thr Arg Lys Ser Val Arg lie Gly Pro Gly Gin 
305 310 315 320 



Ala Phe Tyr Ala Thr Asn Asp Val lie Gly Asn lie Arg Gin Ala His 
325 330 335 



Cys Asn lie Ser Thr Asp Arg Trp Asn Lys Thr Leu Gin Gin Val Met 
340 345 350 



Lys Lys Leu Gly Glu His Phe Pro Asn Lys Thr lie Gin Phe Lys Pro 
355 360 365 



His Ala Gly Gly Asp Leu Glu lie Thr Met His Ser Phe Asn Cys Arg 
370 375 380 



Gly Glu Phe Phe Tyr Cys Asn Thr Ser Asn Leu Phe Asn Ser Thr Tyr 
385 390 395 400 



131 



His Ser Asn Asn Gly Thr Tyr Lys Tyr Asn Gly Asn Ser Ser Ser Pro 
405 410 415 



lie Thr Leu Gin Cys Lys lie Lys Gin lie Val Arg Met Trp Gin Gly 
420 425 430 



Val Gly Gin Ala Thr Tyr Ala Pro Pro lie Ala Gly Asn lie Thr Cys 
435 440 445 



Arg Ser Asn lie Thr Gly lie Leu Leu Thr Arg Asp Gly Gly Phe Asn 
450 455 460 



Thr Thr Asn Asn Thr Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg 
465 470 475 480 



Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Glu lie Lys 
485 490 495 



Pro Leu Gly lie Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg 
500 505 510 



Glu Lys Arg Ala Val Gly lie Gly Ala Val Phe Leu Gly Phe Leu Gly 
515 520 525 



Ala Ala Gly Ser Thr Met Gly Ala Ala Ser lie Thr Leu Thr Val Gin 
530 535 540 



Ala Arg Gin Leu Leu Ser Gly lie Val Gin Gin Gin Ser Asn Leu Leu 
545 550 555 560 



Lys Ala He Glu Ala Gin Gin His Met Leu Gin Leu Thr Val Trp Gly 
565 570 575 



He Lys Gin Leu Gin Ala Arg Val Leu Ala He Glu Arg Tyr Leu Lys 
580 585 590 



Asp Gin Gin Leu Leu Gly He Trp Gly Cys Ser Gly Arg Leu He Cys 
595 600 605 



Thr Thr Ala Val Pro Trp Asn Ser Ser Trp Ser Asn Lys Ser Glu Lys 
610 615 620 



132 



Asp lie Trp Asp Asn Met Thr Trp Met Gin Trp Asp Arg Glu lie Ser 
625 630 635 640 



Asn Tyr Thr Gly Leu lie Tyr Asn Leu Leu Glu Asp Ser Gin Asn Gin 
645 650 655 



Gin Glu Lys Asn Glu Lys Asp Leu Leu Glu Leu Asp Lys Trp Asn Asn 
660 665 670 



Leu Trp Asn Trp Phe Asp lie Ser Asn Trp Pro Trp Tyr lie Lys lie 
675 680 685 



Phe He Met He Val Gly Gly Leu He Gly Leu Arg He He Phe Ala 
690 695 700 



Val Leu Ser He Val Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu Ser 
705 710 715 720 



Phe Gin Thr Leu Thr Pro Ser Pro Arg Gly Leu Asp Arg Leu Gly Gly 
725 730 735 



He Glu Glu Glu Gly Gly Glu Gin Asp Arg Asp Arg Ser He Arg Leu 
740 745 750 



Val Ser Gly Phe Leu Ser Leu Ala Trp Asp Asp Leu Arg Asn Leu Cys 
755 760 765 



Leu Phe Ser Tyr His Arg Leu Arg Asp Phe He Leu He Ala Val Arg 
770 775 780 



Ala Val Glu Leu Leu Gly His Ser Ser Leu Arg Gly Leu Gin Arg Gly 
785 790 795 800 



Trp Glu He Leu Lys Tyr Leu Gly Ser Leu Val Gin Tyr Trp Gly Leu 
805 810 815 



Glu Leu Lys Lys Ser Ala He Ser Leu Leu Asp Thr He Ala He Thr 
820 825 830 



Val Ala Glu Gly Thr Asp Arg He He Glu Leu Val Gin Arg He Cys 
835 840 845 



133 



Arg Ala lie Leu Asn lie Pro Arg Arg lie Arg Gin Gly Phe Glu Ala 
850 855 860 



Ala Leu Leu 
865 



<210> 145 
<211> 869 
<212> PRT 

<213> Human immunodeficiency virus TVl . 8_5 
<400> 145 

Met Arg Val Met Gly Thr Gin Lys Asn Cys Gin Gin Trp Trp lie Trp 
15 10 15 



Gly lie Leu Gly Phe Trp Met Leu Met lie Cys Asn Thr Glu Asp Leu 
20 25 30 



Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Arg Glu Ala Lys Thr 
35 40 45 



Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Thr Glu Val His 
50 55 60 



Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gin 
65 70 75 80 



Glu lie Val Leu Gly Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn 
85 90 95 



Asn Met Ala Asp Gin Met His Glu Asp lie lie Ser Leu Trp Asp Gin 
100 105 110 



Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn 
115 120 125 



Cys Thr Asp Thr Asn Val Thr Gly Asn Arg Thr Val Thr Gly Asn Thr 
130 135 140 



Asn Asp Thr Asn lie Ala Asn Ala Thr Tyr Lys Tyr Glu Glu Met Lys 
145 150 155 160 



Asn Cys Ser Phe Asn Ala Thr Thr Glu Leu Arg Asp Lys Lys His Lys 
165 170 175 
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Glu Tyr Ala Leu Phe Tyr Lys Leu Asp lie Val Pro Leu Asn Glu Asn 
180 185 190 



Ser Asn Asn Phe Thr Tyr Arg Leu lie Asn Cys Asn Thr Ser Thr lie 
195 200 205 



Thr Gin Ala Cys Pro Lys Val Ser Phe Asp Pro lie Pro lie His Tyr 
210 215 220 



Cys Ala Pro Ala Asp Tyr Ala lie Leu Lys Cys Asn Asn Lys Thr Phe 
225 230 235 240 



Asn Gly Thr Gly Pro Cys Tyr Asn Val Ser Thr Val Gin Cys Thr His 
245 250 255 



Gly lie Lys Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu 
260 265 270 



Ala Glu Glu Gly lie lie lie Arg Ser Glu Asn Leu Thr Glu Asn Thr 
275 280 285 



Lys Thr lie lie Val His Leu Asn Glu Ser Val Glu lie Asn Cys Thr 
290 295 300 



Arg Pro Asn Asn Asn Thr Arg Lys Ser Val Arg lie Gly Pro Gly Gin 
305 310 315 320 



Ala Phe Tyr Ala Thr Asn Asp Val lie Gly Asn lie Arg Gin Ala His 
325 330 335 



Cys Asn lie Ser Thr Asp Arg Trp Asn Lys Thr Leu Gin Gin Val Met 
340 345 350 



Lys Lys Leu Gly Glu His Phe Pro Asn Lys Thr lie Lys Phe Glu Pro 
355 360 365 



His Ala Gly Gly Asp Leu Glu lie Thr Met His Ser Phe Asn Cys Arg 
370 375 380 



Gly Glu Phe Phe Tyr Cys Asn Thr Ser Asn Leu Phe Asn Ser Thr Tyr 
385 390 395 400 
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Tyr Pro Lys Asn Gly Thr Tyr Lys Tyr Asn Gly Asn Ser Ser Leu Pro 
405 410 415 



lie Thr Leu Gin Cys Lys lie Lys Gin lie Val Arg Met Trp Gin Gly 
420 425 430 



Val Gly Gin Ala Met Tyr Ala Pro Pro lie Ala Gly Asn lie Thr Cys 
435 440 445 



Arg Ser Asn lie Thr Gly lie Leu Leu Thr Arg Asp Gly Gly Phe Asn 
450 455 460 



Asn Thr Asn Asn Asp Thr Glu Glu Thr Phe Arg Pro Gly Gly Gly Asp 
465 470 475 480 



Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Glu 
485 490 495 



lie Lys Pro Leu Gly lie Ala Pro Thr Lys Ala Lys Arg Arg Val Val 
500 505 510 



Gin Arg Lys Lys Arg Ala Val Gly lie Gly Ala Val Phe Leu Gly Phe 
515 520 525 



Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser lie Thr Leu Thr 
530 535 540 



Val Gin Ala Arg Gin Leu Leu Ser Gly lie Val Gin Gin Gin Ser Asn 
545 550 555 560 



Leu Leu Lys Ala lie Glu Ala Gin Gin His Met Leu Gin Leu Thr Val 
565 570 575 



Trp Gly lie Lys Gin Leu Gin Ala Arg Val Leu Ala lie Glu Arg Tyr 
580 585 590 



Leu Lys Asp Gin Gin Leu Leu Gly lie Trp Gly Cys Ser Gly Arg Leu 
595 600 605 



lie Cys Thr Thr Ala Val Pro Trp Asn Ser Ser Trp Ser Asn Lys Ser 
610 615 620 
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Glu Ala Asp lie Trp Asp Asn Met Thr Trp Met Gin Trp Asp Arg Glu 
625 630 635 640 



lie Asn Asn Tyr Thr Glu Thr lie Phe Arg Leu Leu Glu Asp Ser Gin 
645 650 655 



Asn Gin Gin Glu Lys Asn Glu Lys Asp Leu Leu Glu Leu Asp Lys Trp 
660 665 670 



Asn Asn Leu Trp Asn Trp Phe Asp lie Ser Asn Trp Leu Trp Tyr lie 
675 680 685 



Lys He Phe He Met He Val Gly Gly Leu He Gly Leu Arg He He 
690 695 700 



Phe Ala Val Leu Ser He Val Asn Arg Val Arg Gin Gly Tyr Ser Pro 
705 710 715 720 



Leu Ser Phe Gin Thr Leu Thr Pro Ser Pro Arg Gly Leu Asp Arg Leu 
725 730 735 



Gly Gly He Glu Glu Glu Gly Gly Glu Gin Asp Arg Asp Arg Ser He 
740 745 750 



Arg Leu Val Ser Gly Phe Leu Ser Leu Ala Trp Asp Asp Leu Arg Ser 
755 760 765 



Leu Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Phe He Leu He Ala 
770 775 780 



Val Arg Ala Val Glu Leu Leu Gly His Ser Ser Leu Arg Gly Leu Gin 
785 790 795 800 



Arg Gly Trp Glu He Leu Lys Tyr Leu Gly Ser Leu Val Gin Tyr Trp 
805 810 815 



Gly Leu Glu Leu Lys Lys Ser Ala He Ser Pro Leu Asp Thr He Ala 
820 825 830 



He Ala Val Ala Glu Gly Thr Asp Arg He He Glu Leu Val Gin Arg 
835 840 845 



He Cys Arg Ala He Leu Asn He Pro Arg Arg He Arg Gin Gly Phe 
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850 855 860 



Glu Ala Ala Leu Leu 
865 



<210> 146 
<211> 854 
<212> PRT 

<213> Human immunodeficiency virus TV2. 12-5/1 
<400> 146 

Met Arg Ala Arg Gly lie Leu Lys Asn Tyr Arg His Trp Trp lie Trp 
15 10 15 



Gly lie Leu Gly Phe Trp Met Leu Met Met Cys Asn Val Lys Gly Leu 
20 25 30 



Trp Val Thr Val Tyr Tyr Gly Val Pro Val Gly Arg Glu Ala Lys Thr 
35 40 45 



Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Lys Glu Val His 
50 55 60 



Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gin 
65 70 75 80 



Glu Val lie Leu Gly Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn 
85 90 95 



Asp Met Val Asp Gin Met Gin Glu Asp lie lie Ser Leu Trp Asp Gin 
100 105 110 



Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn 
115 120 125 



Cys Thr Asn Ala Thr Val Asn Tyr Asn Asn Thr Ser Lys Asp Met Lys 
130 135 140 



Asn Cys Ser Phe Tyr Val Thr Thr Glu Leu Arg Asp Lys Lys Lys Lys 
145 150 155 160 



Glu Asn Ala Leu Phe Tyr Arg Leu Asp lie Val Pro Leu Asn Asn Arg 
165 170 175 
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Lys Asn Gly Asn lie Asn Asn Tyr Arg Leu lie Asn Cys Asn Thr Ser 
180 185 190 



Ala lie Thr Gin Ala Cys Pro Lys Val Ser Phe Asp Pro lie Pro lie 
195 200 205 



His Tyr Cys Ala Pro Ala Gly Tyr Ala Pro Leu Lys Cys Asn Asn Lys 
210 215 220 



Lys Phe Asn Gly lie Gly Pro Cys Asp Asn Val Ser Thr Val Gin Cys 
225 230 235 240 



Thr His Gly lie Lys Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly 
245 250 255 



Ser Leu Ala Glu Glu Glu lie lie lie Arg Ser Glu Asn Leu Thr Asn 
260 265 270 



Asn Val Lys Thr lie lie Val His Leu Asn Glu Ser lie Glu lie Lys 
275 280 285 



Cys Thr Arg Pro Gly Asn Asn Thr Arg Lys Ser Val Arg lie Gly Pro 
290 295 300 



Gly Gin Ala Phe Tyr Ala Thr Gly Asp lie lie Gly Asp lie Arg Gin 
305 310 315 320 



Ala His Cys Asn lie Ser Lys Asn Glu Trp Asn Thr Thr Leu Gin Arg 
325 330 335 



Val Ser Gin Lys Leu Gin Glu Leu Phe Pro Asn Ser Thr Gly lie Lys 
340 345 350 



Phe Ala Pro His Ser Gly Gly Asp Leu Glu lie Thr Thr His Ser Phe 
355 360 365 



Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Thr Asp Leu Phe Asn 
370 375 380 



Ser Thr Tyr Ser Asn Gly Thr Cys Thr Asn Gly Thr Cys Met Ser Asn 
385 390 395 400 
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Asn Thr Glu Arg lie Thr Leu Gin Cys Arg lie Lys Gin lie lie Asn 
405 410 415 



Met Trp Gin Glu Val Gly Arg Ala Met Tyr Ala Pro Pro lie Ala Gly 
420 425 430 



Asn lie Thr Cys Arg Ser Asn lie Thr Gly Leu Leu Leu Thr Arg Asp 
435 440 445 



Gly Gly Asp Asn Asn Thr Glu Thr Glu Thr Phe Arg Pro Gly Gly Gly 
450 455 460 



Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val 
465 470 475 480 



Glu lie Lys Pro Leu Gly Val Ala Pro Thr Ala Ala Lys Arg Arg Val 
485 490 495 



Val Glu Arg Glu Lys Arg Ala Val Gly He Gly Ala Val Phe Leu Gly 
500 505 510 



Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser He Thr Leu 
515 520 525 



Thr Val Gin Ala Arg Gin Leu Leu Ser Gly He Val Gin Gin Gin Ser 
530 535 540 



Asn Leu Leu Arg Ala He Glu Ala Gin Gin His Met Leu Gin Leu Thr 
545 550 555 560 



Val Trp Gly He Lys Gin Leu Gin Ala Arg Val Leu Ala He Glu Arg 
565 570 575 



Tyr Leu Gin Asp Gin Gin Leu Leu Gly Leu Trp Gly Cys Ser Gly Lys 
580 585 590 



Leu He Cys Thr Thr Asn Val Leu Trp Asn Ser Ser Trp Ser Asn Lys 
595 600 605 



Thr Gin Ser Asp He Trp Asp Asn Met Thr Trp Met Gin Trp Asp Arg 
610 615 620 



Glu He Ser Asn Tyr Thr Asn Thr He Tyr Arg Leu Leu Glu Asp Ser 
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625 



630 



635 



640 



Gin Ser Gin Gin Glu Arg Asn Glu Lys Asp Leu Leu Ala Leu Asp Arg 
645 650 655 



Trp Asn Asn Leu Trp Asn Trp Phe Ser lie Thr Asn Trp Leu Trp Tyr 
660 665 670 



lie Lys lie Phe lie Met lie Val Gly Gly Leu lie Gly Leu Arg lie 
675 680 685 



lie Phe Ala Val Leu Ser Leu Val Asn Arg Val Arg Gin Gly Tyr Ser 
690 695 700 



Pro Leu Ser Leu Gin Thr Leu lie Pro Asn Pro Arg Gly Pro Asp Arg 
705 710 715 720 



Leu Gly Gly lie Glu Glu Glu Gly Gly Glu Gin Asp Ser Ser Arg Ser 
725 730 735 



lie Arg Leu Val Ser Gly Phe Leu Thr Leu Ala Trp Asp Asp Leu Arg 
740 745 750 



Ser Leu Cys Leu Phe Cys Tyr His Arg Leu Arg Asp Phe lie Leu lie 
755 760 765 



Val Val Arg Ala Val Glu Leu Leu Gly His Ser Ser Leu Arg Gly Leu 
770 775 780 



Gin Arg Gly Trp Gly Thr Leu Lys Tyr Leu Gly Ser Leu Val Gin Tyr 
785 790 795 800 



Trp Gly Leu Glu Leu Lys Lys Ser Ala lie Asn Leu Leu Asp Thr lie 
805 810 815 



Ala lie Ala Val Ala Glu Gly Thr Asp Arg lie Leu Glu Phe lie Gin 
820 825 830 



Asn Leu Cys Arg Gly lie Arg Asn Val Pro Arg Arg lie Arg Gin Gly 
835 840 845 



Phe Glu Ala Ala Leu Gin 
850 
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<210> 147 

<211> 875 

<212> PRT 

<213> Artificial 

<220> 

<223> HIV Env consensus sequence 



<220> 

<221> misc_f eature 

<222> (148) . . (149) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (151) . . (151) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (153) . . (153) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (155) . . (156) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

< 2 2 1 > mi s c_f ea tur e 

<222> (192) . . (192) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (366) . . (366) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (406) . . (409) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eature 

<222> (421) . . (421) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (475) . . (476) 

<223> Xaa can be any naturally occurring amino acid 

<400> 147 
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Met Arg Val Met Gly Thr Gin Lys Asn Cys Gin Gin Trp Trp lie Trp 
15 10 15 



Gly lie Leu Gly Phe Trp Met Leu Met lie Cys Asn Val Glu Asp Leu 
20 25 30 



Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Arg Glu Ala Lys Thr 
35 40 45 



Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Thr Glu Val His 
50 55 60 



Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gin 
65 70 75 80 



Glu lie Val Leu Gly Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn 
85 90 95 



Asn Met Val Asp Gin Met His Glu Asp lie lie Ser Leu Trp Asp Gin 
100 105 110 



Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn 
115 120 125 



Cys Thr Asn Thr Asn Val Thr Gly Asn Arg Thr Val Thr Gly Asn Ser 
130 135 140 



Asn Ser Asn Xaa Xaa Ala Xaa Ala Xaa Tyr Xaa Xaa Glu Glu Met Lys 
145 150 155 160 



Asn Cys Ser Phe Asn Val Thr Thr Glu Leu Arg Asp Lys Lys His Lys 
165 170 175 



Glu Tyr Ala Leu Phe Tyr Lys Leu Asp lie Val Pro Leu Asn Asn Xaa 
180 185 190 



Glu Asn Ser Asn Asn Phe Thr Tyr Arg Leu lie Asn Cys Asn Thr Ser 
195 200 205 



Thr lie Thr Gin Ala Cys Pro Lys Val Ser Phe Asp Pro lie Pro lie 
210 215 220 
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His Tyr Cys Ala Pro Ala Gly Tyr Ala lie Leu Lys Cys Asn Asn Lys 
225 230 235 240 



Thr Phe Asn Gly Thr Gly Pro Cys Tyr Asn Val Ser Thr Val Gin Cys 
245 250 255 



Thr His Gly lie Lys Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly 
260 265 270 



Ser Leu Ala Glu Glu Gly lie lie lie Arg Ser Glu Asn Leu Thr Glu 
275 280 285 



Asn Thr Lys Thr lie lie Val His Leu Asn Glu Ser Val Glu lie Asn 
290 295 300 



Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Val Arg lie Gly Pro 
305 310 315 320 



Gly Gin Ala Phe Tyr Ala Thr Asn Asp lie lie Gly Asn lie Arg Gin 
325 330 335 



Ala His Cys Asn lie Ser Thr Asp Arg Trp Asn Lys Thr Leu Gin Gin 
340 345 350 



Val Met Lys Lys Leu Gin Glu His Phe Pro Asn Lys Thr Xaa lie Lys 
355 360 365 



Phe Lys Pro His Ala Gly Gly Asp Leu Glu lie Thr Met His Ser Phe 
370 375 380 



Asn Cys Arg Gly Glu Phe Phe Tyr Cys Asn Thr Ser Asn Leu Phe Asn 
385 390 395 400 



Ser Thr Tyr His Asn Xaa Xaa Xaa Xaa Asn Gly Thr Tyr Lys Tyr Asn 
405 410 415 



Gly Asn Ser Ser Xaa Pro lie Thr Leu Gin Cys Lys lie Lys Gin lie 
420 425 430 



lie Arg Met Trp Gin Gly Val Gly Gin Ala Met Tyr Ala Pro Pro lie 
435 440 445 



Ala Gly Asn lie Thr Cys Arg Ser Asn lie Thr Gly lie Leu Leu Thr 
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450 455 460 



Arg Asp Gly Gly Phe Asn Asn Thr Asn Thr Xaa Xaa Thr Glu Thr Phe 
465 470 475 480 



Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr 
485 490 495 



Lys Tyr Lys Val Val Glu lie Lys Pro Leu Gly lie Ala Pro Thr Lys 
500 505 510 



Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly lie Gly 
515 520 525 



Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala 
530 535 540 



Ala Ser lie Thr Leu Thr Val Gin Ala Arg Gin Leu Leu Ser Gly lie 
545 550 555 560 



Val Gin Gin Gin Ser Asn Leu Leu Lys Ala lie Glu Ala Gin Gin His 
565 570 575 



Met Leu Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin Ala Arg Val 
580 585 590 



Leu Ala lie Glu Arg Tyr Leu Lys Asp Gin Gin Leu Leu Gly lie Trp 
595 600 605 



Gly Cys Ser Gly Lys Leu lie Cys Thr Thr Ala Val Pro Trp Asn Ser 
610 615 620 



Ser Trp Ser Asn Lys Ser Glu Ala Asp lie Trp Asp Asn Met Thr Trp 
625 630 635 640 



Met Gin Trp Asp Arg Glu lie Ser Asn Tyr Thr Asn Thr lie Tyr Arg 
645 650 655 



Leu Leu Glu Asp Ser Gin Asn Gin Gin Glu Lys Asn Glu Lys Asp Leu 
660 665 670 



Leu Glu Leu Asp Lys Trp Asn Asn Leu Trp Asn Trp Phe Asp lie Ser 
675 680 685 
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Asn Trp Leu Trp Tyr lie Lys lie Phe lie Met lie Val Gly Gly Leu 
690 695 700 



lie Gly Leu Arg lie lie Phe Ala Val Leu Ser lie Val Asn Arg Val 
705 710 715 720 



Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Leu Thr Pro Ser Pro 
725 730 735 



Arg Gly Pro Asp Arg Leu Gly Gly He Glu Glu Glu Gly Gly Glu Gin 
740 745 750 



Asp Arg Asp Arg Ser He Arg Leu Val Ser Gly Phe Leu Ser Leu Ala 
755 760 765 



Trp Asp Asp Leu Arg Ser Leu Cys Leu Phe Ser Tyr His Arg Leu Arg 
770 775 780 



Asp Phe lie Leu He Ala Val Arg Ala Val Glu Leu Leu Gly His Ser 
785 790 795 800 



Ser Leu Arg Gly Leu Gin Arg Gly Trp Glu He Leu Lys Tyr Leu Gly 
805 810 815 



Ser Leu Val Gin Tyr Trp Gly Leu Glu Leu Lys Lys Ser Ala He Ser 
820 825 830 



Leu Leu Asp Thr He Ala He Ala Val Ala Glu Gly Thr Asp Arg He 
835 840 845 



He Glu Leu Val Gin Arg He Cys Arg Ala He Leu Asn He Pro Arg 
850 855 860 



Arg He Arg Gin Gly Phe Glu Ala Ala Leu Leu 
865 870 875 
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